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Preface 
The CONVEX Family: Systems Methodology 

The architecture of the CONVEX family of 64-bit supercomputers was designed 
and developed as the result of careful insight and in response to real cus- 
tomer demand . 

The dominance of the 64-bit supercomputer as the mainstay of the computer 
market in the 80* s is explained in part by the incessant demand for 
increased processing performance inherent in scientific applications. 

However, for the high-end scientific market, general purpose computers do 
not supply the exceptional performance required. Coupling the host to an 
external array processor improves performance, but such systems are diffi- 
cult to program and are expensive. Realizing this shortcoming and antici- 
pating customer demand, in 1982 CONVEX Computer Corporation made the com- 
mitment to design and develop a family of sophisticated, high-performance 
computers for the 1980 's and beyond. 

At CONVEX Computer Corporation, the concern was to provide the scientific 
market with a computer family which would offer the same performance levels 
as the most sophisticated systems on the market while also providing supe- 
rior software support and programmer productivity tools — all at a price 
favorable to our customers. Responding to customer demand and foreseeing 
the need for the total computer system, CONVEX architects and design 
engineers designed the CONVEX family of computers. 

What resulted was the architecture for the whole CONVEX family of comput- 
ers, and, notably, C0NVEX-1, the world's first scientific 64-bit supercom- 
puter with integrated vector processing capabilities. Offering the best 
performance levels in the minicomputer industry, our computer family com- 
bines proven high-performance architecture with advanced semiconductor 
technology, programmed using state-of-the-art software tools. 

Central to the development of the CONVEX family of computers is the CONVEX 
architecture, where the incorporation of a vector processor as an integral 
part of the system resulted in new performance levels and user benefits. 
These customer benefits were achieved as a result of the dynamic marriage 
of VLSI, the CONVEX instruction set architecture, and interactive, user- 
friendly software development tools. For our customers, these benefits 
include the following: 

* One integrated system means lower total system cost. 

* Programmer productivity is optimized. 
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* Efficient implementation in VLSI and high throughput are 
achieved through the use of a RISC (Reduced Instruction Set Com- 
puter) architecture. 

* A state-of-the-art globally optimizing and vectorizing FORTRAN 
compiler assures excellent programmer productivity and optimum 
speed for the scientific market. 

* The CONVEX family is fully supported by a robust, flexible 
software system that combines efficient implementation with high 
performance in user- friendly distributed systems. 

* UNIX**, the industry- standard operating system, supports all 
user application needs: real time for time-critical applica- 
tions; highly- interactive multi -programming, and networking sup- 
port for distributed environments. 

Finally, the CONVEX architecture itself needs special consideration. 
Designed specifically for the scientific market from a solid base (the 
CONVEX-1) , the architecture prescribes a systems methodology for future 
computers. Our pride in our achievement is derived from the architectural 
design itself: 

* Central to the architectural design of the CONVEX family of 
supercomputers is the integrated vector processor for optimal 
per f ormance . 

* The family supports a full range of fixed and floating point 
data types for scientific applications. 

* The system offers 4 Gigabytes of virtual memory with 2 Gigabytes 
available to each user for the support of massive user programs 
and data. 

* The large, high speed register sets — address, scalar, and 
vectoi guarantee high performance for address calculations in 
parallel with scalar and vector calculations. 



* 



The CONVEX family is developed around a RISC (Reduced Instruc- 
tion Set Computer) architecture for optimum implementation and 
throughput . 

* A clean and powerful multi -level protection mechanism supports 
and separates users, enhancing total system reliability, and 
increasing the performance of operating system functions. 

* Exceptional scalar performance for data management and system 
control functions is assured. 

* Rapid subroutine entry and exit ensure the remarkable execution 
speeds specified in the architectural design. 
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This handbook has been prepared as an invitation to the user to share with 
us in the perspicuous design of the CONVEX family of supercomputers. 



** UNIX is a trademark of Bell Telephone Laboratories, Incorporated. 
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Introduction 

CHAPTER 1 



This document is an architectural specification for the CONVEX family of 
compatible 64-bit supercomputers. The term architecture has been crisply 
defined as "the attributes of a system as seen by the programmer, i.e., the 
conceptual structure and functional behavior, as distinct from the organi- 
zation of the data flow and controls, the logical design, and the physical 
implementation" [Amdahl, 1] . Within this context, an architectural specifi- 
cation defines the instruction set, the structure of the logical address 
space "Logical Address Space" and protection mechanisms, the fault, trap, 
and interrupt facilities, and the I/O space as perceived by the programmer 
and properly embodied by the hardware designer. Specifically, in CONVEX 
machines, the central processing unit is the portion of the machine which 
is defined by this architecture. The document is not meant to convey a 
particular implementation. However, no architectural decisions and trade- 
offs are made without careful consideration of the potential effects on 
both hardware and software. 

Throughout this document, we at CONVEX have presented rationales for many 
design decisions. This explication is included in order to forestall the 
obvious questions and to enable the user to understand the primary design 
motivations. These motivations are: the orthogonal projection of the 
instruction set; the reduction in hardware complexity (especially as it 
relates to performance and instruction decoding- -hence the adoption of the 
RISC architecture) , and the universal guidelines of "garbage in/garbage 
out." A frequently used term for this type of design is perspicuous: sim- 
ple, elegant, and easy to understand. 

In order to introduce the reader to the organization and content of this 
handbook, the following pages contain brief summaries of the book's various 
chapters. The chapters are arranged as follows: 



o Data Types 
o Register Set 
o Logical Address Space 
o Memory Management 
o Protection System 
o Exceptions 
o I/O and Interrupts 
o Instruction Set 
Section 1 1-1 



Introduction 
1.1 Data Types 



There are three generic scalar data types: numeric fixed point integer, 
numeric floating point, and unsigned numeric values. An array structure 
(ordered sequence) is provided for each of these data types. An array has 
four general characteristics: 



o data types 

o rank or dimension 

o length 

o stride 



Refer to Figure 1-1 for a graphic representation of array terminology. In 
this example, A is a 3 by 4 array of words. An array is a data structure, 
composed of elements. In this case, the elements are words. Data types 
are manipulated by instructions defined in the latter sections of this 
handbook . 



1.2 Re gister Sfita 

There are three general register sets and several status registers . The 
three general register sets are: 

o Address registers (8 x 32 bits) 

o Scalar registers (8 x 64 bits) 

o Vector Registers (8 vectors, each 128 elements x 64-bits) 

The registers are partitioned according to the operand to be manipulated: 
addresses (and scalar indices), scalars, and vectors. 
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Figure 1-1: Vector Terminology 
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Introduction 

1.2.1 Eixed Eaini Integer 

The four fixed point integer representations-^, 16, 32, and 64 bits-- (also 
referred to as byte, halfword, word, and longword, respectively) correspond 
to the following FORTRAN lengths: INTEGER*1, INTEGER*2, INTEGER*4, and 
INTEGER*8, respectively. Fixed point numbers use the 2's complement 
numbering system. 

1.2.2 Elaaiina Eaini 

There are two floating point number representations: single precision word 
(32 bits) and double precision longword (64 bits) . These formats are 
interpreted as binary normalized fractions with an implicit "1" bit in the 
most significant bit position of the fraction. The exponent is in biased 
form. 



1.2.3 Unsigned Yalues/i 

An address or logical value is treated as unsigned. Addresses are 32 bits 
in length and reside in the address registers. For numeric purposes, an 
address is treated as an unsigned 32 bit integer. 

1.2.4 Data Type Msmoxy. Alignment 

The CONVEX logical address is byte granular; thus, all of the operands can 
begin on any byte boundary. However, performance may degrade if some data 
types are not aligned on an integral boundary (e.g., 64 bit integer 
operands are aligned if the least significant 3 bits of their byte address 
are 000) . 

1.3 Logical Addr ess Space 



vir- 



The CONVEX architecture offers four Gigabytes (4.3 billion bytes) of 
tual memory in its logical address space partitioned into 8 x 512 megabyte 
segments. Of these 8 segments, 4 are allocated to the user and 4 to the 
operating system: thus, the maximum user program (instructions and data) 
is 2 Gigabytes . 

This allocation means that a user program (instructions and data) written 
in FORTRAN can occupy up to 2 Gigabytes of virtual storage. The operating 
system data structures and instructions necessary to manage the user pro- 
gram occupy the remaining 2 Gigabytes of virtual storage. 
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1.3.1 Memory Management 

The memory management hardware permits the operating system to provide an 
extremely flexible and reliable virtual memory programming environment- As 
mentioned above, the logical address space of the CONVEX architecture is 
virtual; so, even though an address may be a valid logical address, the 
referenced data may or may not be in main or physical memory. Memory is 
managed on a fixed-size page basis. To manage the memory, the CONVEX archi- 
tecture defines and supports several attributes: 



o Segment Descriptor Registei a 32-bit register that contains a 

pointer to the first level page table. 

o Page--a contiguous group of bytes, in particular, 4096. A page 
is both logically and physically contiguous. 

o Page Frame- -a page that is stored in main memory. 

o Page Tables--a page that contains entries called Page Table 
Entries (PTE) . A pagetable begins on an integral page boundary 
and is contained in one pageframe or less. First level page 
tables contain PTE's which have pointers to second level page 
tables; second level page tables contain pointers to physical 
page frames . 

o Page Table Entry (PTE) — a 4 byte entry (32 bits) that conveys 
information to determine, for instance, whether or not a page is 
resident in main memory. 

o Referenced and Modified bits--these determine whether a valid 
memory read or write has occurred. 

o Address Translation Unit--a programmer invisible address cache 
that maintains the most recently used logical to physical 
address translations. 



1.3.2 Memory Protection System 

The architecture of the CONVEX family of supercomputers provides a full 4 
gigabyte virtual address space, supporting very large user programs. In 
addition, the program environment is enhanced by embedding the operating 
system in the user address space. From these vantage points, the user and 
the system benefit enormously. However, to realize these benefits fully, 
the operating system must be protected from the user. The protection sys- 
tem designed into the architecture protects the user, his programs, and 
other user's programs, while also supporting contemporary notions of shar- 
ing and operating system structures. This system is based on hierarchical 
structures called rings and has been designed to: 
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o Support the embedding of the operating system in the user logi- 
cal address space 

o Contain certain violations to the user's process 

o Permit the implementation of the reliable and robust UNIX 
operating system 

o Enhance the performance of operating system call processing by 
reducing the time for context switching 



1.4 Exceptions, L/Q, and Interrupts 

Exceptions are invoked when problems occur in a currently executing program 
(arithmetic inconsistencies or Address Translation Eaults, for example), or 
as a result of some asynchronous event (such as an interrupt) . Control is 
then transferred to some predetermined location, the value of which is a 
function of the exception. 

To the processor, all I/O data references are memory mapped, which means 
that there are no explicit I/O data reference instructions. I/O registers 
and status bits are referenced through the appropriate logical to physical 
address mapping. The I/O register space is 1 billion bytes: in essence, up 
to one billion I/O registers can be referenced. Generally, I/O operand 
references must be on an integral boundary, so that the least significant 
address bits equal to the precision of the referenced operand will be all 
0. 

Interrupts are a result of asynchronously occurring events and belong to 
the system and not to the executing process. When an interrupt occurs, the 
processor will vector to a particular handler as a function of the source 
of the interrupt. 



1.5 Instruction Set 

The CONVEX family of supercomputers offers an extremely powerful instruc- 
tion set designed to provide maximum functionality per instruction con- 
sistent with ease of hardware decode and orthogonality of specification. 
The instruction set is used to generate logical addresses, load, store, and 
manage operands, and manipulate the virtual machine mechanisms. 

A CONVEX instruction is one of three lengths: one, two, or three halfwords 
(equivalent to instructions of 16, 32, or 48 bits in length). Even though 
the fundamental unit of addressability is the byte, instructions are 
addressed on a ha If word boundary. All instructions begin on even byte 
boundaries . 
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1.6 The East of the Document 

The following sections of the document examine the architecture of the CON- 
VEX computer family in detail . 

Chapter 2 discusses the organization of data types and the addressing modes 
available. 

Chapter 3 provides full coverage of the register set. 

Chapter 4 describes in full the protection system. 

Chapter 5 gives an overview of the logical address space and memory manage- 
ment structures . 

Chapter 6 provides details on exceptions. 

Chapter 7 discusses the I/O's and interrupts' capabilities. 

Chapter 8 summarizes the instruction set format. 

Chapter 9 offers a tutorial on the address register instructions and pro- 
vides a detailed description of each instruction available. All entries 
are organized after the order in which they are discussed in the tutorial, 
and each includes a full description of the operation and effects of that 
instruction. Chapters 10, 11, 12, 13, 14, 15, and 16 are organized in the 
same way. 

Chapter 10 covers the scalar register instruction set. 

Chapter 11 describes the program control instruction set. 

Chapter 12 details the privileged instruction set. 

Chapter 13 defines the vector/scalar instruction set. 

Comparisons/mask/merge/compress instructions are covered in Chapter 14. 

Chapter 15 lists the vector reduction instructions . 

Chapter 16 describes the VL, VS, and VM instruction set. 

Appendix A lists notational conventions used in the handbook. 

Appendix B sorts the operand codes by number. 

Appendix C sorts the operand codes by name. 

Appendix D contains the assembler notation. 

Appendix E covers floating point computations. 
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Appendix F is a glossary of technical terms. 

Further Reading 

This handbook describing the architecture of the CONVEX computer family is 
complemented by The. COJEffiX Hardware Handbook. This text details the 
hardware characteristics of each CONVEX implementation and lists all of the 
particulars that would be of interest to a system or application program- 
mer, as well as to a hardware designer. 

Readers should note that a list of notational conventions, glossary of 
technical terms, and complete index are included in the latter sections of 
this text. The notational conventions have been established because a 
baseline methodology and set of consistent definitions are required for a 
proper understanding of this document. Finally, a feedback form is 
enclosed as the penultimate page in the handbook, and readers are invited 
to share with us their observations on the service and clarity of this 
text. 



Notes 

[Amdahl, 1] G. M. Amdahl, G.A. Blaauw, and F. P. Brooks, Jr. "Architecture 
of the IBM System/360." IBM Journal of Research and Development a, 2 (April 
1964), 87-101. 
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CHAPTER 2 

2 Data Types 

There are 3 generic scalar data types: numeric fixed point integer, numeric 
floating point, and unsigned value. An address or logical value is treated 
as unsigned. An array structure is provided for each scalar data type, 
except unsigned. An array is an ordered sequence of any of these scalar 
data types. Arrays have four general characteristics: data type, rank or 
dimension, length, and stride. The rank or dimension is simply the number 
of indices necessary to reference a particular element. The length is the 
total number of elements in the entire array. For example, an array with 10 
rows and 10 columns is a rank 2 array that has 100 elements. The length of 
the array is limited by the compiler and the logical address space. The 
stride is the distance in bytes between adjacent array elements along the 
same dimension. For example, for a one dimension word vector, the stride is 
4 bytes. From the user's perspective, data types supported by the processor 
are specified below. 

In the appropriate chapters, instructions are defined which manipulate the 
data types presented in this chapter. Unless otherwise specified mixed mode 
arithmetics on data types or manipulations on operands in registers must 
rigorously follow the conventions provided. Any attempt to circumvent these 
conventions through knowledge of an internal representation can be 
dangerous and is not recommended. If conventions are circumvented, it is 
not guaranteed that results can be reproduced from one implementation to 
another . 



2.1 Elxsd. Eaint Integer 



There are four fixed point integer representations: 8, 16, 32, and 64 bits. 
These numbers are referred to as; byte, halfword, word, and longword 
respectively. These numbers also correspond to the following Fortran 
lengths: INTEGER*1, INTEGER*2, INTEGER*4, INTEGER*8 respectively. 

Fixed point numbers use the 2's complement numbering system, which means 
that the most significant bit, the sign bit, has a value equal to (where n 
is the bit position within the number num<n>) : 

(-1) * (num<n>) * (2** (n) ) . 

All other bits have a value equal to (num<n>) * (2** (n) ) . The format of these 
four fixed point data types is shown in Figure 2-1. 
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Figure 2-1: Fixed Point Integer Representations 



Byte 



Halfword 



15 



Word 



31 



Longword 



63 

Note: S represents the sign bit. 



A scalar fixed point integer or unsigned value can be loaded into one of 
two types of registers (the register set is described in a later chapter) : 
the address or scalar. Generally, operands that are to be used as an 
address or index value, or that are to be manipulated in parallel to a com- 
putation performed in the scalar or vector registers, are loaded into the 
address registers. The address registers are 32 bits in length. Thus, long- 
word operands cannot be directly loaded into an address register. Operands 
that are to be used for numeric processing only are generally loaded into 
the scalar registers. 



When operands with a precision less than the destination 

loaded, the remaining bits of the register are unchanged. For example 

when a 16-bit integer is loaded into a 32-bit A register, the higher 

16 bits of the A register (A<32. .16>) are unchanged. 



register are 
order 



Thus, a byte is loaded into bits<7..0> of a register; a halfword is loaded 
into bits<15. .0> of a register; a word(integer or single precision) is 
loaded into bits<31..0> of a register, and longword is loaded into 
bits<63..0> of a register. 

Because of this register and operand (address and scalar) partition, pro- 
cessor architectures can be constructed which allow for asynchronous and 
overlapped fetch and execute units. Consequently, all address calculations 
can be done in parallel to numeric calculations. This feature is highly 
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desirable and provides for increased performance. 



2.2 



Efflint 



There are two floating point number representations: single precision word 
(32 bits) and double precision longword (64 bits) . These formats and 
their interpretation follow the CONVEX F-format single and CONVEX G-format 
double precision architecture. These formats have normalized binary frac- 
tions and biased binary exponents. The fractions have an implicit "1" bit 
in the most significant bit position. The format of the single precision 
(32 bit) floating point number is: 



where : 



| S | Exponent | Fraction | 

31 30 23,22 



S = the sign bit. A binary value of denotes 
positive. A binary value of 1 denotes 
negative. Numbers in this form are termed 
sign and magnitude. 

Exponent = A binary biased exponent. That is, 
the decimal value of the exponent is 
obtained by evaluating the unsigned binary 
value of bits<30. . 23> . Then 128 is subtracted 
from this value. This value is then used as a 
power of 2 . 

Fraction = A fractional value. An implicit 1 bit 
is to the left of bit 22. The decimal point is to 
the left of the implicit 1 bit. 



The range of a single precision operand is approximately 10** -38 through 
10**+38. 

The format of the double precision (64 bit) floating point number is: 
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| S | Exponent | Fraction | 

63,62 52,51 



where : 



S = the sign bit. A binary value of denotes 
positive. A binary value of 1 denotes 
negative. Numbers in this form are termed 
sign and magnitude. 



Exponent = A binary biased exponent. That is 
the decimal value of the exponent is obtained 
by evaluating the unsigned binary value of 
bits <62..52>. Then 1024 is subtracted from this 
value. This value is then used as a power of 2 

Fraction = A fractional value. An implicit 1 bit 
is to the left of bit 51. The decimal point is to 
the left of the implicit 1 bit. 

The range of a double precision operand is approximately 10**-308 through 
10**+308. 



2.2.1 Eiflating Eszint Yaluss 

The value of a floating point operand is determined in the following 
manner. The sign of the operand is determined by the sign bit. Sign bit= 
is positive; sign bit=l is negative. The exponent is expressed as a 
biased exponent, which means that a positive number called the bias is 
added to the signed 2*s complement of the true exponent. The following is 
an example of this bias. Assume that an exponent of is desired. Then for 
single precision, the value 128 (the bias) is added to 0. Thus the 
binary exponent string of (1000 0000) represents an exponent of 0. For 
double precision, the bias is 1024. 

The fraction has a binary point to the left of the most significant bit. 
Since the fraction is binary and is always normalized (the most significant 
fraction bit is a 1) for non-zero numbers, the fraction is expanded by one 
bit in the most significant bit within the processor during computation. 
The binary point is to the left of the implicit 1 bit. For computational 
purposes only, the fraction is interpreted as follows: 
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Fraction in register 



22 



.|1 bits <22..0> | Numerical Processing 

23 

Within these formats, there are certain reserved or special operands. In 
particular, these operands are used to represent zero or to initiate an 
exception • 

2.2.2 Zero. 

A zero is a floating point number with a sign of and an exponent of 0. 
The value of the fraction is unimportant. For example, if two floating 
point zeros with different fractions are compared for equality, the result 
is true . 

For all computations that have a result of zero for the fraction, an all 
zero fraction is generated. A floating point zero with a fraction of all 
is called a TRUE ZERO. 



2.2.3 Esseryed. 

A floating point number (single or double) that has a sign bit of 1 and an 
exponent of all is defined as a reserved operand. The value of the frac- 
tion bits is unimportant. 

A reserved operand exception is detected if a reserved operand is encoun- 
tered during a floating point numeric operation (e.g., ADD, SUBTRACT, COM- 
PARE, MAX, and so on) . 



2.2.4 Rounding 

All floating point algorithms use unbiased rounding, denoted as R* . For 
single precision, the processor determines the intermediate results of 
internal calculations by manipulating 26 bits. These bits include 23 frac- 
tion bits; the implicit 1 bit placed at "Unbiased Rounding, Fraction Bits" 
the left of the most significant fraction bit, and two guard bits placed at 
the right of the least significant fraction bit. In addition, a "sticky" 
bit is placed to the right of the guard bits. This bit is used in the 
intermediate calculations of floating point operands, and remembers whether 
or not any binary l's were shifted out to the right during an alignment or 
partial product operation. For double precision, internal calculations use 
55 bits plus the sticky bit. J 

Section 2.2.4 2-5 




A logical address is 32 bits in length, resides in the address registers, 
and has the following format: 



| Logical Address 

31 



An address is treated, for numeric purposes, as an unsigned 32 bit integer; 
thus, an address is always positive. A logical address of all is 
reserved. Note that for any particular implementation, the interpretation 
of an all logical address may differ. 



2.4 Unsupported Baia Types 

While unusual, it is important to know what data types are not supported by 
the instruction set. Byte and bit strings and commercial data types are not 
in the CONVEX architecture. CONVEX is a scientific machine, and, while For- 
tran '77 defines byte strings, it is up to the compiler to handle these 
data types through the appropriate basic instructions. Since the CONVEX 
logical address space is byte granular and all processors will be highly 
pipelined, an in-line string move, constructed of assembly language 
instructions, will be as efficient as a microcoded string move instruction. 
This is the essence of a RISC architecture. 



2 . 5 Data Type M emory Alig nment 

The CONVEX logical address is byte granular. All the operands specified in 
this chapter can begin on any byte boundary, unless otherwise noted in an 
instruction definition. 

However, there may be performance penalties for data types aligned on non- 
integral boundaries. In particular the application programmer and the com- 
piler writer (when applicable), should follow the following alignment 
rules: 

* Byte - No preference 

* Halfword - least significant address bit = 

* Word - least significant 2 address bits =0 

* Longword - least significant 2 address bits = 
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CHAPTER 3 
3 Register Ss£ 



There are three general register sets and additional status registers. The 
registers are partitioned according to the operand to be manipulated: 
addresses (and indices), scalars, and vectors. This partition permits the 
minimal machine state to be associated with the executing program- -be it 
the operating system, compiler, scalar application program, or vector 
application program. 



3.1 Address Registexs/Eragram Cfluatax/ESH 

There are eight 32-bit address registers denoted as AO, A1,...,A7. AO, A5, 
A6, and A7 have predefined meanings (with the exception of AO, they can 
still be used as general purpose address registers) . A5 is implicitly used 
by some complex instructions; A6 is the Argument Pointer; A7 is the Frame 
Pointer, and AO is the Stack Pointer. In addition, AO is interpreted in 
one of two ways. If AO is specified as an addressing mode, then the value 
of is used in place of the true value of AO. When AO is used as a source 
or destination for an arithmetic operation, then the true value of AO is 
used. All other address registers are available for general use. For com- 
plete information on stacks and the registers used to maintain them, see 
Chapter 5, "Logical Address Space and Memory Management". 

Two additional 32 bit registers exist: the program counter (PC) and a pro- 
cessor status word (PSW) . The PC is not part of the eight A register set. 
The separate definition and existence of a PC permits address generation 
not having to concern itself with the true state of the PC. Thus, in a 
highly pipelined processor with instruction overlap, no additional data 
paths or arithmetic logic units need exist to support PC relative address- 
ing in a general manner. 

The structure of the program counter is as follows: 



|SEG| SEGMENT BYTE OFFSET |R| 

3 2,2 

1 9,8 1 



Program Counter 



When the program counter increments to reference the next instruction, the 
bits incremented are a function of bit 31. If bit 31 is a 1, then 
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bits<30..1> are incremented. If bit 31 is a 0, then bits <28..1> are 
incremented. Bit of the program counter is not interpreted but is 
reserved for future use for hardware design. 



3.1.1 Erassssar Status Word 

The other 32 bit register is the user accessible processor status word. 
This register contains flags to indicate the results of numerical opera- 
tions and flags to enable or disable exception processing. There are no 
privileged mode bits in the PSW. The structure of the PSW is as follows: 



|C|AIV|ADZ|IVE|TR|FRL|SEQ|SC|SIV|SDZ|DZE|UN|OV|RO|FDZ|FEIFUE| res | 

33 22 2222 22 2 2111 11 11 

10 98 7654 3 2 1 0987 65 43 

Processor Status Word 



where the bits have the following meanings: 

Bit 31 - C/Carry. The carry bit is set to the carry out for specified 
operations on the A (address) registers. For compare operations using the 
A registers, the sense of the compare is stored in the carry bit (i.e., was 
the compare true or false) . 

Bit 30 - AlV/Overflow. The address register integer overflow bit is set to 
indicate that a fixed point integer overflow occurred for specified opera- 
tions on the A registers. An overflow trap occurs if bit 28 is a 1. If AIV 
is a 0, then no overflow has occurred since this bit was cleared. If AIV 
is a 1, then at least one overflow has occurred since this bit was last 
cleared. 

Bit 29 - ADZ/Divide by Zero. A divide by zero was detected during an opera- 
tion using the A registers. If ADZ is a 0, then no integer division with a 
zero divisor has occurred since this bit was cleared. If ADZ is a 1, then 
at least one integer division with a zero divisor has occurred since this 
bit was last cleared. 

Bit 28 - IVE - Integer Overflow Enable. If bit 28 is a one, and either bit 
22 or bit 30 is a one, an integer trap occurs. If bit 28 is a zero, no 
trap occurs. 

Bit 27 - TR/Trace. A one causes an instruction trace trap before the exe- 
cution of the instruction referenced by the program counter. A non- 
privileged user can set or reset this bit. For trace mode to function 
properly, the user must set the SEQ bit (bit 24) to 1. 

Bits<26..25> - FRL/Frame Length. Indicates whether the frame created by the 

Section 3.1.1 3-2 



Register Set 

last CALL instruction, TRAP, or fault was a short frame (ERL=11) or a long 
frame (FRL=10) , an extended frame (FRL=01) , or a context block (FRL=00) . 
The FRL bits are used by the return (rtn) instruction to unwind the stack 
after a subroutine call or exception . When FRL=00, the current ring must be 
0, and the rtnc instruction must be used. 

Bit 24 - SEQ/Sequential . This bit determines the degree of pipelining 
within the processor. A indicates that maximum pipelining and overlap is 
provided. A 1 indicates that all instructions are executed sequentially. 
That is, the execution of an instruction is initiated only after the previ- 
ous instruction has completed its execution. 

Bit 23 - SC/Scalar Carry. The carry out for operations involving the S 
(scalar) registers. This bit is also used to hold the result for scalar 
comparisons . 

Bit 22 - SIV/Integer Overflow. If SIV is a 0, then no overflow has 
occurred since this bit was cleared. If SIV is a 1, then at least one 
overflow has occurred since this bit was last cleared. 

Bit 21 -- SDZ/Divide by Zero. Indicates the occurrence Of an integer 
divide by zero for a divide on a scalar or vector register. If SDZ is a 0, 
then no integer division with a zero divisor has occurred since this bit 
was cleared. If SDZ is a 1, then at least one integer division with a zero 
divisor has occurred since this bit was last cleared. 

Bit 20 - DZE - Divide by Zero Enable. If bit 20 is a one, and either bit 
21 or bit 29 is a one, a trap occurs. If bit 20 is a 0, no trap occurs. 

Bit 19 - UN/Underflow. Indicates that a floating point underflow occurred 
during an operation on a scalar or vector register. If UN is a 0, then no 
underflow has occurred since this bit was last cleared. If UN is a 1, then 
at least one floating point underflow has occurred since UN was reset to 0. 

Bit 18 - OV/Overflow. Indicates that a floating point overflow occurred 
during an operation on a scalar or vectpr register. If OV is a 0, then no 
operation on a scalar or vector register produced an overflow since this 
bit was last cleared. If OV is a 1, then at least one floating point over- 
flow has occurred since OV was reset to 0. 

Bit 17 - RO/Reserved Operand. Indicates that a floating point operation on 
a reserved operand during an operation on a scalar or vector register has 
been detected. A reserved operand is a floating point operand with a sign 
bit of 1 and an exponent of all 0' s. A 1 indicates that a reserved operand 
has been detected. A indicates that a reserved operand has not been 
detected. If RO is a 0, then no floating point operation on a scalar or 
vector register produced an overflow since this bit was last cleared to 0. 
If UN is a 1, then at least one floating point overflow has occurred since 
RO was reset to 0. 

Bit 16 - FDZ/Floating Divide by Zero. Indicates that a floating point 
divisor of was used during a floating point operation on a scalar or 
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vector register. A indicates that no zero divisor was detected. A 1 
indicates that a zero divisor was detected. If FDZ is a 0, then no float- 
ing point division with a zero divisor has occurred since this bit was 
cleared. If FDZ is a 1, then at least one floating point division with a 
zero divisor has occurred since this bit was last cleared. 

Bit 15 - FE/Floating Point Trap Enable. If bit 15 is a 1, then if any of 
bits PSW<18..16> (Overflow, Reserved Operand, Divide By Zero) are a 1, a 
floating point trap occurs. If bit 15 is a 0, no floating point trap 
occurs . 

Bit 14 - FUE/ Floating Point Underflow Enable. If bit 14 is a 0, a float- 
ing point underflow trap does not occur. If bit 14 is a 1, a floating 
point trap underflow does occur. In both cases, if a floating point under- 
flow is detected, true zero is the result. 

Bits <13..0> - RES/Reserved. These bits are reserved for future system 
use. 



For bits 30, 29, 22, 21, 19, 18, 17 and 16 (AIV, ADZ, SIV, SDZ, OV, UN, RO, 
FDZ) a bit set to 1 stays a 1 unless otherwise cleared by an explicit store 
of the PSW. This permits the PSW to remember the occurrence iaf an exception 
which is masked out, but which is to be subsequently explicitly tested. 

Thus, the logic equation for these exception bits is (using UN as an exam- 
ple) : 

UN = UN .OR. Indication of underflow. 
NOTE: 

For floating point exceptions, two trap enables are provided: one for 
underflow and one for every other floating point exception. The reasoning 
for this relates directly to the possibility of continuing computation 
after the trap. Underflow forces a true zero result, which for most cir- 
cumstances, is sufficient. All other floating point exceptions force a 
reserved operand result. Reserved operands are generally markers for future 
trap handlers. Two trap enables permit the application programmer to 
choose the appropriate reaction to a floating point exception. 

Please see the appropriate sections on arithmetic exceptions, subroutine 
calls, and system calls for a description of the modification of the PSW. 

3.2 iqh and YY Eiaas 

There are two privileged binary flags, ION and W, which control certain 
operations. The ION flag is used by the operating system to enable and 
disable external interrupts. There are instructions to test the state of 
the ION flag (bri.f, bri.t, jmp.f, and jmp.t) . There are also privileged 
instructions to disable ION or set it to (dsi) , and to enable interrupts 
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or set ION to 1 (eni) . 

The W or vector valid flag is also used by the operating system to control 
the saving and restoring of the vector accumulators in a demand mode. Typ- 
ically, when a program first uses vector instructions, a vector valid trap 
occurs, and the operating system will then allocate vector register 
(VM,VL,VS, and Vector Accumulators) to the user program. There are two 
instructions which operate on the W flag. The privileged instruction mov 
Sk,W loads the W flag from Sk. The other instruction, tstvv, loads the 
value of W into the SC bit in the PSW. Please see Section 6.3.3 for 
details on the vector valid trap. 



3 . 3 Sequential Execution 

When bit 24, SEQ, of the PSW is set, all pipelining is disabled. Owing to 
the pipelined characteristics of the processor, execution for debugging 
purposes (both for hardware and software) must sometimes be serialized. 
The numerical results produced are the same regardless of the setting of 
the SEQ bit. Only performance and the serial nature of the execution are 
affected. The user may freely set or reset this bit. 



3.4 Data &££un 

There are two general sets of data registers: scalar and vector. Within 
the vector register set, there are four types: vector accumulators (V) , 
vector merge (VM) , vector stride (VS) , and vector length (VL) . 

Within the S and V registers, the architecturally supported data types 
occupy the following bit positions: 



* BYTE - bits<7 . . 0> 

* HALFWORD - bits<15 . . 0> 

* WORD - bits<31..0> 

* LOGICAL - bits<63..0> 

* LONGWORD - bits<63..0> 

* SINGLE PRECISION - bits<31..0> 

* DOUBLE PRECISION - bits<63..0> 



3.4.1 



There are eight 64-bit scalar accumulators or S registers. The S registers 
contain either fixed point integer, logical, or floating point operands. 
When an operand less than 64 bits is loaded into a scalar register, the 
unused bits are left unchanged. The scalar registers are referenced as: 
SO, SI S7. 
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3.4.2 Vector 

3.4.2.1 Yectar Accuouiatars ■ 

There are eight vector accumulators (V) , each of which can contain up to 
128 64-bit operands. These operands can be: integer, logical, or floating 
point. When an operand less than 64 bits is loaded into a vector accumula- 
tor, the unused bits are left unchanged. 

Since a vector accumulator can contain up to 128 elements of the same data 
type and precision, a means is provided to specify the exact number of 
operands stored. The VL or Vector Length register provides this function. 

The vector accumulators are referenced as: VO, VI, ..., V7. Individual 
elements within a vector accumulator are referenced by appending the ele- 
ment number to the vector accumulator designation. Thus, the 22nd element 
of VI is referenced as VI (21) , and the first element of VI is VI (0) (origin 
indexing) . 

3.4.2.2 Vector Merge 

To support efficient element-by-element array comparisons and array manipu- 
lations such as compress, expand, and merge, a Vector Merge or VM register 
is provided. VM is 128 bits in length, with one bit position for each pos- 
sible element in an array accumulator. 

A binary 1 is used to contain the results of compare operations when those 
comparisons are true; likewise, a binary is used to contain the results 
of compare operations when those comparisons are false. 

Typical uses of VM (as supported by the instruction set) are: 

1 Vector Clipping 

2 Population Count (the number of successful compares) 

3 The manipulation of sparse vectors. 

4 Array compression, expansion, and merging. 

5 The number and location of zero or threshold crossings. 

6 Support arithmetic operations that are performed under mask. 

3.4.2.3 Victor Stride 

A 32 -bit register, VS, is provided to specify the distance, in bytes, 
between adjacent array elements. If VS is positive, adjacent array ele- 
ments are loaded and stored from memory by adding a multiple of VS to the 
initial address of the array base. If VS is negative, adjacent array ele- 
ments are loaded and stored from memory by subtracting a multiple of VS 
from the initial address of the array base. In this latter case, logically 
adjacent elements are in decreasing locations in logical memory. 

If VS is smaller than the precision of the operands fetched, undefined 
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actions can occur. If VS is exactly 0, the referenced operand is extended 
to a vector whose length is equal to VL (scalar extension) . 

3.4.2.4 

An 8-bit register, VL, is provided to specify the number of elements 

contained in a vector accumulator. Since VL is eight bits in length, up 

to 255 can be specified, but only 128 elements can reside in a vector 
accumulator. The VL register must have: 

1 A specification of 128 elements or less. 

2 An efficient means for compiler support of arrays greater than 
128 where the length of the array is an integer multiple of 128 

(often called strip mining or sectioning) . 

3 An efficient means for compiler support of arrays greater than 
128 where the length of the array is not an integer multiple of 
128. 

4 The provision of one mechanism for handling loop control for the 
cases where the iteration count of a program loop is a variable. 



Note: when VL is zero, no vector operation is performed. With these objec- 
tives in mind, the interpretation of VL is exactly equal to its decimal 
value. 

Examples: 

1 VL = 1000 0000. VL represents the value 128. 

2 VL - 0000 0111. VL represents the value 7. 

3 VL = 0000 0000. VL represents the value (no-operation) 



An attempt to load VL with a value greater than 128 will result in the 
decimal value of VL being exactly 128. 

The following code sequences would suffice for handling an array of any 
length: 

1 Load VS. 

2 Load VL. 

3 Execute the vector instruction sequence. 

4 Subtract 128 from the value used to load VL in step 2 . 

5 If the resulting value is less than or equal to 0, then all of 
the array elements have been manipulated, and thus exit. Other- 
wise : 

6 Adjust the address pointers to the array section (the next 128 
elements) or unroll the DO loop in-line. GOTO step 2. (Note: VS 
need not be reloaded) . 

Please note that even though the VL register is 8 bits, a vector can be of 
any arbitrary length up to the user logical address space of 2 Gigabytes. 
As mentioned above, the compiler performs operations on vectors in groups 
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f 128 elements, through a mechanism called sectioning or strip mining. 



o 
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CHAPTER 4 
4 Era£e.g£jo.n System 



The protection system protects the user, his programs, and other user's 
programs, while also supporting contemporary notions of shared resources 
and operating system structures. These features do not get in the way of 
system performance or cause undue hardware complexity. 

The protection system has been designed to: 

1 Support the embedding of the operating system in the user logi- 
cal address space (for reasons of performance and software reli- 
ability) . 

2 Contain certain violations to a user's process. A user can and 
may modify his/her own process, but not any others. 

3 Detect runaway programs in a graceful manner. 

4 Permit efficient implementations of virtual machine mechanisms. 

In order to achieve these objectives, the protection system is structured 
in the following manner. The logical address space is partitioned into 5 
hierarchical areas called rings. This partitioning is defined by the seg- 
ment field (bits <31..29>) of the logical address. Segment is always 
assigned to ring 0, which contains the operating system kernel. A set of 
instructions, . referred to as privileged instructions, can only be executed 
in ring 0. Segment 1 is always assigned to ring 1. Segment 2 is always 
assigned to ring 2 . Segment 3 is always assigned to ring 3 . Segments 
4,5,6, and 7 are always assigned to ring 4. The structure of the logical 
address space is graphically shown in Figure 4-1. 
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Figure 4-1: Logical Address Space Structure 
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This particular structure was chosen because the CONVEX family is a 64-bit 
supercomputer which supports a 32 -bit address space. As a result, the com- 
puter can easily support large user programs in a virtual address space. 
Furthermore, the Operating System (OS) can be embedded in the user address 
space. From these vantage points, the user and the system benefit enor- 
mously. However, to realize these benefits fully, the OS must be protected 
from the user . 

Another advantage to this structure is the segmentation support made avail- 
able for users. Segmentation may be used to support the mechanisms of dif- 
ferent address partitions for user code, static data, dynamic data (stack 
or unshared), and system library code. To these ends, four segments are 
explicitly provided for the user. 

4.1 Legical Address Space Structure. 

Two other structures are needed to complete the protection system: the 
access field contained within a pagetable entry (PTE) , and the access 
brackets pertaining to the enforcement of the ring structure. 



Section 4.1 



4-2 



Protection System 

The access bracket structure directly implements a mechanism called ring 
maximization. Ring maximization means that given a lower access priority, 
the references of the instruction are always at this lower priority. The 
reference starts at the priority implied by the ring field of the program 
counter (bits <31. -29>) . This initial ring is then compared to the ring of 
the referenced operand (if one exists) . As a function of the numerical 
relation between these two ring numbers, a statement concerning the valid- 
ity of the reference can be made. In this ring mechanism, higher ring 
numbers have lower priority than rings with lower numbers. 

A memory reference that satisfies the ring maximization function is valid. 
All valid references must then satisfy the access requirements imposed by 
the access field of the PTE that references the target operand. 

Table 4-1 defines the validity of logical address references. I f an invalid 
reference is detected, a system exception occurs, and an error code is 
loaded into A5 while the fault is being serviced (see Chapter 6, "Excep- 
tions") . The effective source is the ring of the program counter, and the 
effective target is the ring of the address of the referenced operand. If 
indirect addressing is specified, the effective source remains the ring of 
the program counter, and the new effective target is the ring number con- 
tained in the indirect pointer. 



Table 4-1: Ring Maximization Source/Target 
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To determine whether or not a read, write, or execute access should be 
allowed for valid references, the system follows these procedures based on 
an access field within a PTE: 
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1 Read Access - The ring maximization function is used to deter- 
mine whether or not the reference is valid. If it is, bit 3 of 
the valid PTE is examined. If bit 3 is a 1, the read is permit- 
ted. If a 0, the read is not permitted and a system exception 
occurs . 

2 Write Access - The ring maximization function is used to deter- 
mine whether or not the reference is valid. If it is, bit 2 of 
the valid PTE is examined. If bit 2 is a 1, the write access is 
permitted. If a 0, the write is not permitted, and a system 
exception occurs . 

3 Execute Access - If transfer of control within the same ring is 
performed, bit 1 of the valid PTE is examined. If bit 1 is a 1, 
instructions can be fetched and executed from this page. If bit 
1 is 0, instruction execution is not permitted, and a system 
exception occurs . 

The protection mechanisms provided for inter-ring transfer of control (not 
within the same ring) are presented in a subsequent section of this 
chapter . 



4.2 Erofction Natss 

The following notes will help users avoid some pitfalls when using this 
type of protection system: 

1 PC relative addresses are granted no special privileges. The 
appropriate read, write, and execute privileges, as previously 
specified, apply. 

2 Access checking is performed if and only if the pagetable entry 
associated with a valid address is valid. The state of the 
resident bit for checking access privileges is ignored. Thus an 
access violation can be detected for non-resident pages. 

3 If an access privilege is changed for a process after that pro- 
cess has already established a context in the ATU, the ATU must 
be purged upon completion of the alteration. ATU entries are not 
altered automatically when a PTE is modified. 

4 If an instruction specifies an immediate operand (e.g., Add 
immediate) , the read access privilege of the page containing the 
immediate operand is not interpreted; it is treated as an exe- 
cute access. 

5 A ring check is not performed for instructions which produce 
effective addresses, but which do not immediately use them. For 
example, if a load effective address instruction executed in 
ring 3 develops a ring 1 address, no ring violation occurs. If 

Section 4.2 .. 



Protection System 

that ring 1 address is subsequently used by a ring 3 program to 
make an operand reference, a ring violation occurs. 

6 The intermediate addresses of all instructions which can make 
multiple memory references (e.g.. Vector Load) are always ring 
maximized with the current ring to determine the validity of the 
reference (i.e., the address of each array element). 

7 When indirection is specified, the page containing the indirect 
pointer must permit read access. This read access is indepen- 
dent of the instruction type (i.e., load, store, jump) 

8 I/O space operands must be addressed as single bytes. If a valid 
I/O reference is made using a non-byte operand, a protection 
violation occurs. 

The protection structure uses a construction called effective source, which 
has these properties if indirection is specified: 

1 The program counter is still the effective source. 

2 The second target is the ring number contained within the 
indirect pointer. The first target is the ring number contained 
within the effective address that references this indirect 
pointer . 



4.3 £DR Validity Bit Protection 

If bit 31 of an SDR location is 0, a PTE violation occurs. The PTE viola- 
tion is vectored through byte address 0C (hex) in page of ring (a sys- 
tem exception) . To protect against an invalid SDRO which would cause an 
endless PTE violation, the following special test is performed: if SDRO is 
invalid, a machine exception occurs (see Chapter 6) . The response to a 
machine exception is implementation-dependent. 



4.4 Inter -cina Erasedure Call/Return 

Ring crossing can only occur as a result of an explicit attempt by a pro- 
gram control instruction to cross rings, or by a system exception. The con- 
ditions by which these explicit instructions can cross rings are as fol- 
lows: 

1 The explicit program control instruction is either a system call 
(sysc) , return (rtn with FRL=01, extended return block) , or the 
privileged instruction: return from context block (rtnc) . All 
other program control instructions stay within the current ring 
of execution (i.e., the ring of the program counter) . Thus the 
appropriate higher order bits of the target effective address 
are, in essence, ignored. 
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2 The direction of a subroutine system call is inward, toward ring 
0. Outward calls are trapped as a ring violation. The direction 
of all subroutine system returns is outward, away from ring 0. 
Inward returns are trapped as a system exception. 

3 The immediate field of the system call instruction is inter- 
preted as an index into a table within the called ring. This 
index is referred to as a gate number. The table contained 
within the called ring is referred to as a gate array. The base 
of the gate array is pointed to by byte address 4C (hex) of page 
of the called ring. 

The structure of the gate array is illustrated in Figure 4-2. 



Figure 4-2: Gate Array Structure 
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The format of the sysc instruction is: 



| Opcode [L | 000 | Ak | | #r | | 
15 7 6 5 4 3 31 29 28 16 15 



#g 



Inward ring crossing functions in the following manner. The gate index 
field (g field) of the sysc instruction is used to indicate the desired 
entry point. This gate index field is compared with bits <31..16> of the 
first word of the gate array pointed to by byte address 4C (hex) of page 
of the target ring. If the gate index field is greater than or equal to 
bits <31..16>, then a ring violation occurs, and the ring crossing does not 
occur (the gate is not defined). If the gate index field is less than bits 
<31..16>, the ring number of the segment containing the sysc instruction 
(current ring) is compared with bit <31..29> of the referenced gate index 
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(r field) . If the current ring is less than or equal to the bracket (Brae) 
field, then bits <28.,.1> of the gate are loaded into the program counter. 
Bits <31..29> are loaded with the target ring. If the current ring is 
greater than the bracket field, the PC is not loaded, the ring crossing 
does not take place, and a system exception occurs. 

For example, assume that the operating system kernel has n gates. All the 
gates other than gate M are reserved for calls from rings 3, 2, and 1. How- 
ever, gate M (owing to the nature of this kernel call) can be directly 
called by ring 4. All gates in the kernel other than M have the value 3 in 
their gate bracket field. Gate M has the value 4 in its gate bracket 
field. If a ring 4 caller attempts to call a kernel gate other than M, the 
call fails (4 is greater than 3). If a ring 4 caller attempts to call ker- 
nel gate M, the call succeeds (4 is less than or equal to 4) . 

This mechanism permits individual segments to have entry points with unique 
gate brackets. Thus a particular operating system call can be restricted to 
a particular ring of origin. 

All these actions are performed by the processor. There is no software 
overhead or involvement by the operating system kernel, unless an explicit 
kernel call is made. 



4.4.1 Stack Switching/Arguasnfc Eslexencs 

There is one stack per ring, which means that the stack allocated to ring 4 
is logically different from the stack for ring 3, for ring 2, and so on. 

After a successful gate entry, as specified in the previous section, a new 
stack frame is created in the target ring, and an extended subroutine 
return block is pushed onto the target stack (the called routine's stack). 
The stack pointer of this stack is initially loaded from byte address 48 
(hex) of page zero of the called ring. After the extended return block is 
pushed, the stack pointer (AO) is copied into the frame pointer (A7) . The 
PC is loaded from the referenced gate. 

The stack pointer value saved in the extended return block represents the 
value of the caller's stack pointer at the time of the call. It is neces- 
sary to save this value to permit a proper return from a multiplexed stack 
structure. Thus the link back to the outer ring's stack is contained 
within the extended return block pushed on the inner ring's stack. Addi- 
tional details for an inward system call are covered in the description of 
the sysc instruction. 

Arguments for the system call are maintained in a programme! — defined area. 
They may be in an argument packet or on the stack- -whatever software- 
enforced conventions permit. 

The converse of a system call is a system return, which is implemented with 
a return (rtn) instruction. Unlike a system call, no gate processing is 
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necessary. An inner ring can unconditionally access an outer ring, so pro- 
tection is unnecessary. A system return is like a normal return with the 
following four differences. First, the ring field of the Program Counter 
can change. Second, all returns must be the same ring or outward (away from 
ring 0) . Third, the return block on the stack must be an extended or con- 
text type. Fourth, after the return block is popped from the stack, the 
updated stack pointer of the inner ring is restored to byte address 48 
(hex) of page zero of the ring containing the rtn instruction. This guaran- 
tees that, with subsequent system calls to the same ring, the stack will be 
initialized to the proper values. 



4.4.2 Iroian Horse Pointe rs 

Trojan horse pointers can occur on system calls when a passed pointer 
references the operating system's data space. Usually, the software agent 
invoked as part of the inward ring call uses a passed pointer as part of 
system call processing. The agent expects these pointers to reference the 
logical address space of the caller (i.e., the ring of the user executing 
the sysc instruction) . If a passed pointer references an agent's data 
space, unexpected (and generally undetected) disasters occur. To prevent 
such happenings, the following facilities are provided: 

1 An instruction which checks to see that the ring maximization 
function is satisfied for passed pointers (compare immediate) is 
provided . 

2 A load physical instruction is provided to obtain the access 
bits of appropriate pagetable entries. 

3 Instructions which access data backwards (decreasing logical 
memory) always perform the ring maximization function to ensure 
that a dynamic Trojan horse pointer is not created. 

All of these actions can occur outside the operating system kernel. One of 
the objectives of the protection and memory management structure is to 
reduce the size of the operating system's kernel. This permits a more 
reliable and secure kernel to be constructed. Additionally, virtual 
machine structures are easier to construct. 

Generally, there is no algorithm which guarantees that Trojan Horse 
pointers will not occur. Experience has shown that, for a robust system 
call interface, arguments should be copied into the called ring's space, 
and then Trojan horse pointer checking initiated. 

If the arguments are values, the level of required checking is somewhat 
mitigated. This provision prevents one argument pointer from modifying 
another argument after Trojan horse checking; this is essentially the 
self -modifying argument attack. 
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CHAPTER 5 
5 Lexical Address Scacs and Memory Manasement 



The logical address space of CONVEX is virtual. This means that although an 
address may be a valid logical address, the referenced data may or may not 
be in main or physical memory. To manage this structure, various entities 
are defined and supported. Among these structures are: 



1 Segment - A logically contiguous group of bytes--in particular, 
512MB (549,755,813,888 bytes) . 

2 Page--A contiguous group of bytes, in particular, 4096 bytes. A 
page is both logically and physically contiguous. 

3 Segment Descriptor Register (SDR) - -A 32 -bit register that con- 
tains information necessary to translate a logical segment 
offset to a physical address in main memory. 

4 Page Frame — A page that is stored in main memory. 

5 Page Table- -A Page that contains entries called Page Table 
Entries. A pagetable begins on an integral page boundary and is 
contained in one page frame or less. 

6 Page Table Entry (PTE) —A 4 byte entry (32-bits) that conveys 
information necessary to determine if a page is resident in main 
memory or not. Other status bits within a PTE determine the 
validity of the memory reference from a protection viewpoint. A 
PTE is aligned on an integral word boundary. 

7 Referenced Bit- -A bit associated with a page frame. A referenced 
bit indicates that a valid read or write has occurred. 

8 Modified Bit--A bit associated with a page frame. A modified bit 
indicates that a valid write has occurred. 

9 Address Translation Unit (ATU) - -A programmer invisible address 
cache that maintains the most recently used logical to physical 
address translations. 



Sufficient referenced and modified bits exist for the total amount of phy- 
sical memory for a particular implementation. These bits are stored in 
internal machine state and mapped into the I/O address space. 

CONVEX' s logical .address space is 4 Gigabytes (approximately 4.3 billion 
bytes). This space is partitioned into 8 x 512 Megabyte segments. The 
format of the logical address space is: 
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|SEG | Segment Byte Offset | 

31 29,28 o 



There are no other hardware or architecturally supported pointer formats. 
Of these eight segments, four are allocated to the user (essentially 
hardware enforced) , and four to the operating system. Thus the maximum 
user program (instructions and data) is 2 Gigabytes. This permits a user to 
share a common subroutine library in one segment, code in another segment, 
static (e.g., FORTRAN common) in the third segment, and dynamic (e.g., 
stack) in the fourth segment. This partitioning is a suggestion, not an 
enforced algorithm. 

Logical addresses are generated through the use of the program counter, 
absolute references, or one of the eight address registers. 

Data referenced by a byte logical address can begin on any arbitrary byte 
boundary. That is, a 64-bit operand can begin on any one of eight byte 
boundaries. The byte address generated by an instruction references the 
first byte (byte 0) of an operand. However, where storage allocation is 
controlled by the system, the preferred boundary is as follows: 

o byte - not applicable. 

o halfword - least significant address bit is a 0. 

o word - least significant 2 address bits are 00. 

o longword - least significant 2 address bits are 000. 

The software linker must support, for performance reasons, alignment of 
various structures on programmer specified boundaries (e.g., the page boun- 
dary) . 



5 . l Indj.rscU.oja 

All instructions which reference memory can optionally specify a mechanism 
called indirection. Indirection causes the logical address of the refer- 
enced operand to be contained in a memory location. The site of this memory 
location is obtained from the instruction specifying indirection. 

Only one level of indirection can be explicitly specified by an instruc- 
tion. The format of the indirect word is a byte pointer. 

5.2 Siasks 

A stack is an array of bytes organized in a "pushdown" manner. Stacks are 
used to contain operand temporaries and local variables, as well as to 
state information relevant to the environment of the currently executing 
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program. 

There are three architecturally- defined registers that are used to maintain 
a stack: the stack pointer (SP, AO) , the argument pointer (AP, A6) „ and the 
frame pointer (FP, A7) . As a function of the type of operation performed 
on the stack, one. two, or all three of these registers are affected. Gen- 
erally, subroutine entry and exit use all three registers. The following 
subsections describe some of the types of operations performed on a stack. 
Please consult the appropriate instruction set chapter for specific 
details. 



5.2.1 Euan and. Eap. Operands 

There are two primitive operations on a stack. One operation is a push and 
the other operation is a pop. A push stores an operand on the stack; a pop 
removes an operand from the stack. Address register AO points to the top 
element of the stack (the last location used) . A push decrements AO by a 
multiple of 4 or 8; a pop increments AO by a multiple of 4 or 8. 

Pushing a word requires that AO be decremented by 4; then the word is 
stored in the location referenced by the new value of AO. Popping a word 
from the stack requires that the top element is fetched from memory, and AO 
is then incremented by 4. 

The following example depicts these actions. Initially the top of stack is 
at byte 68 (decimal) . Thus, a load word from the top of the stack fetches 
bytes 68, 69, 70, and 71. Pushing a word onto the stack requires that the 
stack pointer first be decremented by 4 (64 = 68-4) ; then the word to be 
pushed is stored into bytes 64, 65, 66, 67, or simply the word that begins 
at byte 64. This is illustrated in Figure 5-1. 



Figure 5-1: Push and Pop Operands 



AO after push ->| 64 | 65 | 66 | 67 |<-New top of stack, AO is 64 
AO before push-> | 68 | 69 | 70 | 71 |<-Prev. top of stack, AO is 68 



Stack 



The stack is managed as an aggregate of 32-bit words, which means that all 
instruction set primitives that manipulate the stack do so by incrementing 
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or decrementing by stack and frame pointer by units of four. Even though 
the stack is referenced by a byte address, this convention is always 
obeyed. Overt modification of the stack pointer (by instructions which 
manipulate AO and A7) by quantities other than multiples of four is not 
recommended. Even though the processor will continue to function, perfor- 
mance will be lost. The stack should be initialized to begin on an integral 
4-byte address boundary. 

There is no explicit stack overflow or stack underflow detection performed 
by the hardware. Stack overflow and underflow may be detected by surround- 
ing the allocated stack by pages (described in a later chapter) of no 
access. Software reserved bits in the protection fields of the no access 
page table entries may be used to differentiate this type of access viola- 
tion from other possible causes. (Page Table Entries are described in 
Chapter 4) . Consequently, the protection trap handler can determine the 
reason for its invocation. 



5.2.2 Creating and Deleting a Stack Exams 

Stacks are generally used as dynamic storage, storage that is allocated and 
deallocated during the execution of a user program. To assist in the 
proper management of the stack, a frame pointer is defined. The frame 
pointer, A7, provides for dynamic linkage between frames contained on a 
stack. Typically, a frame consists of an area that contains saved copies of 
registers from the previous execution context, an area that contains 
storage for temporary variables local to this context, and values necessary 
to manage the present frame as well as a link back to the previous frame. 
See Figure 5-2 below for details on the stack structure. 



5.2.3 Sing Q Stack 

The ring stack (the stack associated with the highest priority ring, ring 
0) must always be aligned on a 32-bit word boundary. If it is not, a 
machine exception occurs (see Chapter 6) . 

5.3 Reserved Lagical Memory 

Reserved logical memory locations are used to obtain addresses or status 
when exceptions occur. Generally when one of these conditions occurs, an 
implicit subroutine call occurs. The processor provides the subroutine call 
opcode, and the reserved area in memory provides the address. Because a 
stack has already been defined, arguments may be passed and a handler rou- 
tine executed. 

The reserved area in logical memory is the first page in the segment refer- 
enced by the ring field of the program counter. This page is referred to as 
Page 0. Since there are five rings, there are five page O's. For ring 4, 
page is always in segment 4. The only page that must be memory resident 
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Figure 5-2: Stack Structure 
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is page of ring 0. 

Page is used in one of two ways, depending on the classification of 
exception (trap or fault) which has occurred. (Exceptions are detailed in 
Chapter 6.) The two types of exceptions which access Page are process 
exceptions and system exceptions. In addition, interrupts also access Page 
0. 



Process exceptions are characterized by the fact that they are handled by 
the user program in the current ring of execution. Examples include arith- 
metic traps and instruction trace. This approach supports the PL/1 "ON" 
condition and anticipated revisions to EORTRAN. It also permits each user 
to have his or her own debugger. The operating system is not involved in 
handling process exceptions. 

A system exception involves the operating system. Examples include address 
translation faults and ring-crossing traps. When such an exception arises, 
a ring crossing to ring occurs, and the ring process stack is used. 
Rings are discussed in more detail in Chapter 4, "Protection System." 

Interrupts also cause a ring crossing to ring 0. An interrupt is an asyn- 
chronous event which the operating system must handle. A unique stack in 
ring is established for interrupts; this stack is different from the 
ring process stack. 



5.3.1 Systsm Eags Q 

Table 5-1 shows the logical memory organization of Page of Ring 0. See 
Chapter 6, "Exceptions," for definitions of the terms "exception " "trap " 
and "fault." 
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Table 5-1: Page Logical Memory Organization 



BYTE 
ADDRESS 
(hex) 31 





4 

8 

C 

10 

14 

18 

1C 

20 

24 

28 

2C 

30-3C 

40 

44 

48 

4C 

50 



16,15 



Reserved 



inter. Level 



Reserved 



I/O Interrupt 



System Exception Handler 



Interval Timer Interrupt 



Reserved 



Reserved 



Vector Valid Trap 



Interrupt Stack Pointer 



Context Stack Pointer 



Reserved 



Previous Stack Pointer 



Reserved 



Instruction Trace Trap 



Arithmetic Exception Trap 



Stack Pointer 



Segment Entry Point 



Breakpoint Trap 



A 



System 



i 



Process 



Section 5.3.1 



5-7 



Logical Address Space and Memory Management 

Each of the above entries has the following meanings. 

Reserved. Should not be used by software. May be used in the 
future . 

1 Interrupt Level. A 16-bit memory-based counter that indicates 
the number of nested interrupts currently being processed. If 
Interrupt Level is 0, then no interrupts are being processed. 
If Interrupt Level is not 0, then interrupts are being pro- 
cessed, and the ring stack is the interrupt stack. 

2 I/O Interrupt. A byte pointer to the handler for I/O inter- 
rupts . 

3 System Exception Handler. A byte pointer to a system exception 
handler. The exceptions that transfer control to this system 
exception handler are: error exit trap; undefined opcode trap; 
ring violation; PTE violation, and non-resident page. 

4 Interval Timer. A byte pointer to the interrupt handler that 
responds to an interval timer interrupt. 

5 Reserved. 

6 Reserved. 

7 Vector Valid Trap. A byte pointer to a trap handler that 
responds to the vector valid trap. A vector valid trap occurs 
if an attempt to execute a vector instruction occurs and the 
vector valid bit is 0. A vector instruction is an instruction 
which manipulates the V, VL,VS, or VM registers. 

8 Interrupt Stack Pointer. A byte pointer that specifies the 
stack to be used when an interrupt occurs. 

9 Context Stack Pointer. A byte pointer that specifies the stack 
to be used when a system exception occurs. 

10 Reserved. 

11 Previous Stack Pointer. A save area used for interrupt process- 
ing. When an interrupt first occurs and the ring stack is 
initialized to the value of the interrupt stack pointer, the 
process stack pointer is saved in byte address 2C (hex) . This 
ensures that there is a proper linkage for stack switching in 
ring for interrupt processing. 

12 Reserved. 

13 Reserved. 

14 Reserved. 
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15 Reserved. 

16 Instruction Trace. A byte pointer to the handler that responds 
to an instruction trace trap. 

17 Arithmetic Exception. A byte pointer to the handler that 
responds to an arithmetic exception. The PSW contains bits 
which indicate the type of arithmetic exception (s) that 
occurred. 

18 Stack Pointer. A save area that maintains the stack pointer for 
cross ring call processing. 

19 Segment Entry Point. A byte pointer to the base of the gate 
array defined in the called ring. Each ring has a unique entry 
point and associated gate array. 

20 Breakpoint Trap. A byte pointer to the handler that is executed 
when the bkpt instruction is executed. 



5.3.2 Process Bags Q 



If a trap is classified as belonging to a user process, page of the 
current ring has the same format as specified above with one excep- 
tion: the first 16 words are reserved. 



5.4 Physical Address Space 

The physical address space is 2 Gigabytes. One Gigabyte is allocated to 
main memory (i.e., up to 1 billion bytes of physical memory can be config- 
ured), and 1 Gigabyte is allocated to I/O registers. Physical addresses 
through 3FFF FFFF (hex) reference main memory. Physical addresses 4000 
0000 (hex) through 7FFF FFFF (hex) reference 1/0 registers. Figure 5-3 dep- 
icts this partition. 
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Figure 5-3: Physical Address Space 
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Within the I/O space, one set of registers is presently defined. These 
registers contain the referenced and modified bits maintained for each 4096 
byte page of physical memory (a page frame) . The architectural limit of one 
Gigabyte of physical memory means that 2**15 bits must be maintained. The 
following allocation has been made: 



1 Addresses 4000 0000 to 4000 7FFF (hex) allocated to Referenced 
Bits . 

2 Addresses 4000 8000 to 4000 FFFF (hex) allocated to Modified 
Bits. 



All other I/O registers are reserved for future system use. It should be 
noted that 1/0 registers are not encached in a processor cache. This per- 
mits the I/O registers to change asynchronously without the processor hav- 
ing to concern itself with the presence of a cache. 

All operands within the 1/0 register space must be one byte. If a valid 
reference is made to an operand other than a byte, a process exception 
(class C (hex), qualifier 7 Invalid 1/0 access) occurs. See Table 6-1. 
The referenced and modified bits are accessed as byte operands. Thus, the 
load and store byte instructions should be used. These instructions load 
and store eight referenced or modified bits at a time. The use of any 
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other type of instructions will produce undefined actions. 

5.5 Translating Lagical Is. Ehysisai Addresses 



Presently, logical memory is substantially larger than physical memory. 
Consequently, a means must be provided to determine if there exists a phy- 
sical page or page frame for a valid logical address. This determination is 
accomplished by a two level pagetable mechanism. A two level pagetable is 
used for accessing data in memory in the same way that a two level file 
index might be used for accessing data on disk. 



5.5.1 Segment Descriptor Register 

A segment descriptor register is a 32 bit word aligned to word boundary, 
which controls the validity of a segment (the basic partition of the logi- 
cal memory space) and provides information relating to address translation. 
There are 8 SDR's, one for each segment. Each SDR is 32 bits long. An SDR 
has the following format: 



|V|0|Page frame Base | hw | sw 

3 3 2 

10 9 9 8 7 6 

Segment Descriptor Register 



The meaning of each of these bits is as follows: 

Bit<31> - Valid - If 0, this segment is not valid. A system exception is 
signaled and an error code is loaded into A5 after a context block is 
saved. (See Chapter 6) . 

Bit<30> - Hardware reserved. Must be zero. 

Bits<29..9> - Page frame base. The page frame base is the higher order 21 
bits of a 30-bit physical address. Bits <8..2> of this physical address 
come from bits <28..22> of the logical address to be translated. Bits 
<1..0> of the physical address are 0. This physical address references a 
page table entry in main memory. The page frame base is modulo 512 bytes. 
See note below. 

Bits<8..7> - Hardware Reserved. These bits presently have no meaning. Sys- 
tem software must not use these bits. 

Bits<6..0> - Software Reserved. 
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The page frame base in the SDR permits, if desired by the operating system, 
for one page (4 KB) to be used to contain the first level page table for 
multiple contiguous segments. The first level page table can be structured 
so that it is contained in a 512 byte page rather than in a 4096 byte page. 
This 512 byte page can be used in one of two ways. It can be used to con- 
serve physical memory by only allocating 512 bytes rather than 4096 to the 
first level lookup. Or, the 512 byte page can be configured to be one of 
the eight possible 512 byte partitions in a 4096 byte page. This last 
feature permits multiple first level lookups to be physically contained in 
one page frame. 



5.6 Eags labls Entry, (ete) 

A pagetable entry (PTE) is a 32-bit word aligned on an integral 32-bit 
boundary (the least significant two bits of the byte address are 00) . A PTE 
is one of 128 entries for the first index level or one of 1024 entries for 
the second index level. A PTE is used to determine the validity of a refer- 
ence and the physical memory location of a valid reference. A valid refer- 
ence meets two requirements: first, the PTE must be valid (bit 31=1), and 
second, the type of access being made (Read, Write, or Execute) must be 
allowed by the appropriate protection bit (bits <3. .1> of the PTE). The 
format of a valid, resident PTE is shown for first level and second level 
page table entries, below: 



| V | | Page frame Address |hw | sw |rd|wr |ex|nr | 

3 3 2 11 

109 21 87 43210 

First Level Page Table Entry 



I V | Page frame Address |hw | sw |rd | wr |ex |nr | 

3 3 11 

10 21 87 43210 

Second Level Page Table Entry 



The meaning of each of these bits is as follows: 

Bit<31> - Valid. Indicates the validity of the PTE. A indicates an 
invalid reference; a 1 indicates a valid reference. A segment out-of-bounds 
error is detected when an invalid PTE is accessed. A reference to an 
invalid PTE results in a system exception. See Table 6-1. 
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When bit <31> and bit <30> are both 1, bit<0> is ignored. A valid PTE that 
references I/O space is always assumed to be resident. 

Bits<30..12> - Page Frame Address. If a valid reference to a resident page 
occurs, then bits <30. .12> become the most significant 19 bits of a 31-bit 
physical byte address. The page frame base is modulo 4096 bytes. For a 
first level page table entry, bit 30 is always 0, while for a second level 
page table entry, bit 30 may be either or 1 . 

Bits<11..9> - Hardware Reserved. These bits are reserved for potential use 
by hardware. Presently, there is no interpretation of these bits. It is not 
recommended that these bits be used for software. 

Bit<8> - Encache. When bit 8 is zero, the data associated with the refer- 
ence are encached. When bit 8 is 1, the referenced data are NOT encached. 

Bits<7..4> - Software Reserved. These bits are reserved for potential use 
for software. 

Bit<3> - Read Access. Indicates the validity of a read access to the refer- 
enced page. A indicates that no read access is permitted. A 1 indicates 
that a read access is permitted to the referenced page. If a read access is 
attempted, and bit 3=0, a system exception is signaled, and an error code 
is loaded into A5. 

Bit<2> - Write Access. Indicates the validity of a write access to the 
referenced page. A indicates that no write access is permitted. A 1 indi- 
cates that a write access is permitted to the referenced page. If a write 
access is attempted, and bit 2=0, a system exception is signaled, and an 
error code is loaded into A5. 

Bit<l> - Execute Access. Indicates the validity of an execute access 
(branch or jump to instruction) to the referenced page. A indicates that 
no execute access is permitted. A 1 indicates that an execute access is 
permitted to the referenced page. If an execute access is attempted, and 
bit 1=0, a system exception is signaled, and an error code is loaded into 
A5. 

Bit<0> - Non-Resident. Indicates the presence or absence of the referenced 
page frame in the physical address space of the process. A indicates the 
absence of the referenced page in physical memory. In this case, a page 
fault occurs and causes a system exception. A 1 indicates the presence of 
the referenced page. In this latter case, bits<30..12> are used as the phy- 
sical page frame address of the referenced page. Bit is interpreted for 
valid references only. 

NOTE: Segment out-of-bounds errors may be detected by resetting all of the 
unused PTE's valid bits to zero. Thus, during logical to physical address 
translation for invalid pages, an out-of-bounds reference causes a system 
exception. 

The physical addresses of all pagetables must reside in physical memory 
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(physical addresses through 3FFF FFFF (hex)). Thus, all physical address 
of pagetable entries are 30 bits in length. 

The format of a valid non-resident PTE is: 



V | | Software Reserved | rd | wr | ex | nr 
31 30 4 3 2 10 



Note that bit <30> must be zero. If bit <30> is a 1, an I/O reference 
can occur. The read, write, and execute bits are then interpreted to 
determine if the reference is valid. 



5.6.1 Einal Translation 

This section presents a pictorial representation of the logical to physical 
translation. The following attributes of this translation are worth noting: 

1 The pagetable referenced by the first level index is always 
resident in physical memory. 

2 The pagetable referenced by the second level index may not be 
resident in physical memory. A page fault can occur when 
referencing a second level pagetable page. 

3 The access bits in the first level pagetable entry are never 
interpreted. That is, no protection access checks are performed 
when a first level pagetable entry is used to reference a second 
level pagetable entry. 

4 If a pagetable entry is invalid, no further translation occurs. 

5 A page fault occurs only for valid references. 

For logical to physical address translation purposes, a 32-bit byte logical 
address has the structure shown in Figure 5-2: 
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"igure 5-4: 32-bit Byte Address: Logical to Physical Translation 



Logical 
Address: 



SDR 


INDEX.1 


INDEX.2 


PAGE OFFSET 



31 



29 28 



A A 

22 21 



i a 

12 11 



SDR Select: use 3 highest order bits of logical 
address to select correct SDR 



Selected 
SDR 

Contents 



31, 



A 

30, 



A 

29 



A A A A 

9 8 7 6 



A 





V 





PAGE FRAME BASE 


HW 


SW 



A 
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2nd level PTE select: use bits <29. .12> of 1st level 
PTE and bits <21. .12> (INDEX.2) of logical address 
to determine address of 2nd level PTE 



Physical 
Address of 
2nd PTE: 
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Physical address determination: use bits <30. .12> 
of the 2nd level PTE and bits <1 1 . .0> (page offset) of 
the logical address to determine the physical 
address being accessed 
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5.7 



Es f er engsd/MadiXiad Bits 



Associated with each page frame are two flags 

The referenced bit is used to indicate that 

write, or execute access) has occurred (bit 

corresponding to that page frame) . The modified bit is used to indicate 

that a successful write has occurred (bit is set to 1) . A successful write 

also sets the referenced bit to 1. 



"referenced" and "modified", 
a successful reference (read, 
is set to 1 in the page 



For the purposes of memory management, a successful reference is defined as 
a memory reference which does not result in a PTE violation on a resident 
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page . 

I/O memory references do not affect the state of the referenced and modi- 
fied bits. When power is first applied, the state of the referenced and 
modified bits is indeterminate. 

The referenced and modified bits are mapped into the I/O space. Thus the 
operating system accesses the referenced and modified bits by mapping the 
appropriate I/O space into the physical address space of a process. 

In particular, the referenced and modified bits are grouped into bytes. 
Consequently, referenced and modified bits are addressed through the use of 
Load and Store byte instructions. References using any other instructions 
produce undefined results. 

5.8 Address Xranslatioji Unit 

The Address Translation Unit is used to accelerate the translation of logi- 
cal to physical addresses. This unit contains a cache of recently 
translated logical addresses. 

The steps necessary to translate a logical to physical address have already 
been described. Once a translation occurs, the association between the log- 
ical to physical addresses should be placed in memory for the following 
reasons : 

1 The steps necessary to translate the logical addresses require 
machine cycles that would otherwise be used to execute instruc- 
tions . 

2 Programs exhibit temporal and spatial locality of reference. 
Thus it is probable that once a logical to physical translation 
is accomplished and encached (remembered) , subsequent logical 
addresses will reference the same page associated with the ini- 
tial translation. 

Because previous translations can be placed in memory, the number of pro- 
cessor cycles allocated to logical address translations is significantly 
reduced, a feature which greatly enhances program performance, and makes 
the ATU an address cache. 

The ATU accelerates address translations by associating a logical address 
with an ATU entry. This ATU entry contains three types of information. One 
type is a page frame physical address. This address holds the contents of 
the PTE that referenced the addressed operand. A second type is the higher 
order bits of the logical address encached. Since the page-offset field of 
the logical address is not translated, this entry contains, at most, the 
most significant 20 bits of the translated logical address. The third type 
of information is concerned with the access privileges associated with the 
addressed page. The ATU entry provides a convenient place to store these 
privileges. Some characteristics of the ATU are relevant to the system 
programmer because : 
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1 The size and structure of the ATU are implementation dependent. 

2 Individual entries within the ATU are not explicitly address- 
able. 

3 Modification of a PTE in memory does not necessarily have an 
immediate effect, if any, on ATU entries. 

4 Several privileged mode instructions exist to permit a level of 
control over ATU address translation in a manner that is ATU 
implementation independent. These instructions purge the entire 
ATU or selective entries. Purging the entire ATU is necessary 
for process multiplexing. Purging selective ATU entries is used 
when selective PTE modifications occur (e.g., when an address 
translation fault finds the physical page in main memory but not 
in the physical space of the process) . 



5.9 Proces s 

A process may be defined as an abstraction of the locus of control that 
passes through an executing program. CONVEX processes are unique in con- 
struction and are composed of two general partitions: one partition is the 
user program; the other is that part of the operating system that is shared 
by all user processes. This user part of the operating system usually 
includes such features as pagetables used for translating logical to physi- 
cal addresses, buffers for disc or terminal records, and various control 
blocks that are created by the operating system on behalf of the user. 

CONVEX processes are unusual in that each has its own private logical 
address space. Thus logical addresses, though identical in two or more 
processes, need not translate to the same physical address. As a result of 
this logical address space structure, the ATU must be purged when a new 
user process is dispatched. Purging an ATU simply involves marking all 
entries as invalid, so that no encached translations exist. 

Owing to the characteristics of ring of the protection system, encached 
entries for ring translations need not be purged. This freedom is possi- 
ble because ring is system-wide (not process-wide) , which means that 
every process shares the same ring 0. Interrupt processing is an example of 
a system-wide service that is performed in ring 0. This partition is dep- 
icted in Figure 5-4. 
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Figure 5-5: Process/System/Segment Partition 
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5.10 EfiHsr Up/fisfltatraEi/Eh^sical Addressing 

When power is first applied and the system is bootstrapped, logical 
addresses equal physical addresses. There is no memory mapping or page 
faulting, and the least significant bits of the logical address space 
representing the physical address space are passed directly to the main 
memory system. 

An instruction exists to initiate virtual address mapping and the control 
of the Address Translation Unit. Additional instructions and/or facilities 
exist to turn off address translation. Generally the existence of the ATU 
is known only to the operating system kernel, and its internal structure is 
implementation dependent. All instructions which manipulate or control the 
ATU are privileged- -they can only be executed in ring 0. 



When logical addressing is disabled, 
always in privileged mode (ring 0) . 



the processor executes as if it were 
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CHAPTER 6 
6 Exceptions 



6.1 



An exception is an event which disrupts the running of a program, process, 
or system. Exceptions occur because of problems in the currently executing 
program (for example, arithmetic inconsistencies or address translation 
faults) , or as a result of some asynchronous event (such as an interrupt or 
hardware failure) . Exceptions result in the transfer of control to a 
predetermined address known as an exception handler. The starting 
addresses of the exception handlers are located in tables in memory 
referred to as Page 0. The definition of these tables is provided in 
Chapter 5. State information is saved on the appropriate stack. 



The primary goals for exception processing are: 

1 Wherever possible, the operating system kernel will not be 
involved. 

2 The hardware will structure exceptions as asynchronous kernel 
calls. This permits the OS kernel to use a single procedure for 
call processing. 

3 The hardware will provide a reasonable and open-ended means to 
indicate the cause of the exception. 

4 The hardware will provide a reasonable means to mask out those 
exceptions which are under user control . 



To achieve these goals, exceptions are grouped into three different 
classes: process, system, and machine. 

1 Process exceptions belong to the currently running process, and 
may be handled with an exception handler in that process. The 
exception handler is in the current ring of execution. 

2 System exceptions cannot be handled by the current process and 
require intervention by the kernel executing in ring 0. 

3 Machine exceptions include fatal errors in the system which can- 
not be handled by the operating system. 
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If exceptions of different classes are pending simultaneously, machine 
exceptions have the highest priority, followed by system exceptions. 

Exceptions may also be subdivided by the way in which they are normally 
treated by the exception handler. The exception handler will handle excep- 
tions in one of two ways depending on the nature of the exception. In many 
cases, the exception handler can correct the underlying cause of an excep- 
tion and permit the original program to resume; on the other hand, the 
exception handler may choose not to correct the cause of the exception and 
not to return control to the original program. Two terms signify the way in 
which the system treats each exception type : 

1 A fault is an exception which the handler can normally correct. 
The exception handler returns control to the program at the 
place which it was interrupted. 

2 A trap is an exception which the handler cannot normally 
correct. Often, the handler terminates the program or process 
and supplies error information to the user. 



6.2 Process Exceptions 



Process exceptions occur at the process level and the user can handle them 
without system intervention. The exception handler which is called resides 
in the current ring of execution. The process exceptions are arithmetic 
trap and instruction trace. Instruction trace, together with sequential 
execution and the breakpoint instruction (bkpt) , provides support for pro- 
gram debugging. In addition, the user can disable or mask out many of the 
process exceptions. 



An arithmetic trap occurs when an operation encounters or produces an ille- 
gal value, one which is not within the representable range of numbers for 
the machine. However, the user can mask out these exceptions using the 
appropriate enable bits provided in the PSW. Arithmetic traps are pro- 
cessed thus : 

1 The processor sets the appropriate bits within the PSW to 1 to 
indicate the cause of the trap. Since a CONVEX processor has 
multiple arithmetic units, it can set more than one bit in the 
saved PSW. Sufficient bits exist to identify multiple trap 
types simultaneously. 

2 The processor pushes an extended return block onto the current 
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stack (no ring crossing occurs) . 

3 The processor clears PSW bits (C, SC, AIV, ADZ, VN, OV, FDZ, RO, 
SIV, SDZ, FRL) of the newly-generated PSW to 0. 

4 Instruction execution for the trap handler begins at the address 
contained at byte address 44 (hex) of page of the current 
ring. 

5 The machine initiates the trap as soon as steps 1 through 4 have 
occurred, unless an exception of higher priority is also pend- 
ing. 

/ 



The following PSW bits report the occurrence of exceptions AIV, ADZ, SIV, 
SDZ, UN, 0V, RO, and FDZ. The PSW bits IVE, DZE, FE, and FUE selectively 
enable groups of arithmetic exceptions. The user may choose to ignore cer- 
tain exceptions by clearing the appropriate enable bit to 0. The 
IVE/Integer Overflow Trap Enable bit corresponds to the SIV and AIV bits; 
the DZE/Divide by Zero Enable bit corresponds to the ADZ and SDZ bits; the 
FE/Floating Point Trap Enable bit corresponds to the 0V, RO, and FDZ bits, 
and finally, the FUE/Floating Point Underflow Enable bit corresponds to the 
UN bit. 



Because of the pipelined nature of the machine, more than one instruction 
may be executing when a trap occurs. This sequence is: 

1 When the machine detects an arithmetic exception that requires a 
trap, it places all pending instructions on hold. 

2 The system allows all current instructions to complete their 
execution . 

3 The system honors the exception only after completing steps 1 
and 2, and only if there are no events pending with a higher 
priority (such as interrupts) . 



6.2.1.1 Details 0_f Arithmetic. Traps 



This section details the characteristics of each type of arithmetic excep- 
tion. 



Integer Overflow. Integer overflow occurs when a result is too large to 
occupy the specified destination. When an integer overflow occurs, the AIV 
or SIV bit in the PSW is set to 1. The result loaded into the destination 
is correct in the least significant bits. When an integer overflow 

Section 6.2.1 ..6-3 



Exceptions 

exception occurs for integer longword multiplication (64 bits) , the result 
is correct in the least significant 53 bits. Bits<63..53> are undefined. 

Integer Divide By Zero. When the divisor is zero, the processor sets the 
appropriate divide by zero bit in the PSW (ADZ or SDZ) to 1. The output of 
the divide is the dividend. 

Floating Divide By Zero. When the divisor is zero, the processor sets the 
FDZ bit to 1. The output of the divide is a reserved operand. 

Floating Point Overflow. When the resulting exponent requires more preci- 
sion than is allowed (greater than 127 (unbiased) for single and greater 
than 1023 for double), a floating point overflow occurs. The result 
operand is forced to a reserved operand (sign=l, exponent and fraction all 
0*s) . The OV bit in the PSW is set. 

Floating Point Underflow. When the resulting exponent requires an exponent 
less than -127 (unbiased) for single precision and less than -1023 for dou- 
ble precision, a floating point underflow occurs. The resulting operand is 
forced to true zero (sign =0, exponent=0, fraction=0) . True zero is forced 
regardless of the value of the underflow trap enable bit. The machine sets 
the UN bit in the PSW. 

Reserved Operand. When an input to a floating point arithmetic operation 
has a sign=l and an exponent of all 0, a reserved operand exception is 
detected. The fraction value is a "don't care." The output of an arithmetic 
operation with a reserved operand input is a reserved operand output. A 
reserved operand output has an all 0's fraction. The R0 bit in the PSW is 
set. 



6.2.2 Debugging Suppari 

The remainder of the process exceptions are useful debugging tools. 
Instruction trace allows a single instruction to execute between each 
exception, the sequential bit (SEQ) in the PSW forces instructions to exe- 
cute one at a time without overlap, and a breakpoint instruction (bkpt) 
causes transfer of control when it is executed. 

6.2.2.1 Instruction lease 

Instruction trace is a useful debugging tool, one which the user can 
directly control. Setting bit 27 of the PSW (TR) enables instruction trace 
and causes a trace trap to occur. After the execution of each instruction, 
the processor pushes an extended return block onto the stack. The Program 
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Counter pushed references the next instruction to be executed in the pro- 
gram. The exception handler is located at the address contained at address 
40 (hex) of the current ring. Since no ring crossing occurs, the process- 
ing of this trap does not involve the operating system. For instruction 
trace to function properly, bit 24 of the PSW, SEQ, must also be set to 1. 



6.2.3 Ssgusntial 



Although sequential execution is not an exception, its value affects the 
operation of the machine and perhaps the specific conditions which exist 
when an exception occurs. As noted in Chapter 3, when a program sets bit 
24, SEQ, of the PSW, all overlapped execution is disabled. Owing to the 
pipelined characteristics of the machine, multiple instructions are often 
executing simultaneously. The SEQ bit forces execution in a serial manner 
for debugging purposes (both for hardware and software) . The numerical 
results produced are the same regardless of the setting of the SEQ bit. 
The procedure only affects performance and the serial nature of the execu- 
tion. The user may freely set or reset this bit. 

6.2.3.1 Breakpoint 



Although the breakpoint instruction (bkpt) is also not a true exception, it 
is included here since it qualifies as a debugging tool. The user may 
insert bkpt instructions anywhere within a program, and execution of the 
bkpt instruction causes a call to the routine addressed by byte 50 (hex) of 
the current ring, and pushes an extended return block on the stack. 



6.3 System Exceptions 



All system exceptions result in a ring crossing to Ring 0, the operating 
system kernel . The return block saved in each case is either extended 
(FRL=01) or context (FRL=00) . Many of the system exceptions are related to 
virtual memory address translation, as described in Chapter 5. Chapter 6 
describes the processing of system exceptions. 



All system exceptions have the following characteristics: 

1 They are not maskable. 

2 They always result in a cross ring call to ring 0. 

3 There are residency and alignment requirements for Page 0. The 
ring stack must always be aligned on a 32-bit (word) boundary, 
and ring page must be resident. If not, a machine exception 
results . 
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6.3.1 Erxsx Exii Trap. 

An error exit trap occurs if the processor encounters an all-zero opcode. 
If the processor attempts to execute code from memory which resides beyond 
the boundaries of a program, this trap will occur, assuming that the memory 
in question has previously been cleared to zero. 

6.3.2 Undefined Opcode Tray 

An undefined opcode trap is executed whenever the processor attempts an 
illegal instruction. 

6.3.3 Vector Yalid Eaulfi 

The vector valid fault, coupled with the Vector Valid bit (W) , permits the 
operating system to save and restore the vector accumulators on demand. 
Additionally, it permits the OS to detect unsuspected use of vector 
instructions. The example below details the use of this fault. 

Assume that 10 programs are running but that only 2 use the vector accumu- 
lators. Upon interrupt processing, during one of these 2 programs, the 
system need not save the vector accumulators since the interrupt service 
routine does not use them. If subsequent programs do not use the vector 
accumulators (either statically because there is no need, or dynamically 
because the particular code segment is not vector in nature) , no time is 
wasted in saving vector machine state. This enhances the real-time charac- 
teristics of the system. However, if one of these subsequent programs 
correctly uses the vector machine state, a recoverable fault occurs. This 
fault indicates to the operating system that it is time to save a previous 
process's vector machine state. Once this state is saved, the affected 
program can resume. 

The vector valid fault functions in this sequence: 

1 The Vector Valid (W) bit must be a 0. 

2 The processor attempts execution of a vector instruction. 

3 The machine now performs a ring crossing to ring 0, and pushes 
an extended return block on the ring stack. 

4 Finally, the machine jumps to the instruction pointed to by 
address 1C (hex) of page of ring 0. 
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6.3.4 Eina Violation Traps 



Ring violation traps are a group of system exceptions concerning invalid 
access to rings. The operation of the ring structure is defined in Chapter 
4. The following ring violations are defined: 

1 Privileged Instruction. This trap occurs when the system 
attempts a privileged instruction outside of ring 0. 

2 Inward Address. A reference to an address which is in an inner 
ring causes this trap. 

3 Outward Call. A call which crosses rings must proceed to an 
inner ring; otherwise this trap will occur. 

4 Inward Return. This trap occurs when a return instruction 
attempts to move to an inward ring; all returns must be to the 
same or an outward ring. 

5 Invalid Gate. If the gate number specified in a call which 
crosses rings is incorrect, this trap will occur. 



6.3.5 ET£ Violation Traps 

PTE violation traps encompass a group of illegal Page Table Entry accesses. 
Pagetables and their usage are defined in Chapter 5. PTE violations 
include: 



1 Read Protect. This trap is invoked when the processor attempts 
a read access to a page whose valid Page Table Entry does not 
allow reads. 

2 Write Protect. A write protect trap occurs during an attempted 
write to a page whose valid PTE does not have write enabled. 

3 Execute Protect. This trap occurs when the processor attempts 
an instruction fetch on a page without execute enabled in its 
valid PTE. 

4 Invalid SDR. A memory access to a segment whose SDR's valid bit 
is not set causes this trap to occur. 

5 Invalid Level 1 PTE. A memory reference to an address whose 
first level PTE's valid bit is not set results in this trap. 

6 Invalid Level 2 PTE. If an address's corresponding Level 2 PTE 
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is not valid, an invalid level 2 PTE trap occurs. 
7 Invalid I/O Access. 

6.3.6 Non- resident Eage Eaults 



A non-resident page fault occurs when the processor attempts to reference a 
memory location which is part of the logical address space but is not part 
of the physical address space. The system initiates a page fault only 
after it has interpreted the validity and appropriate access bits in a Page 
Table Entry. The two forms of this fault are: 



1 Non-resident data page. If the actual data page corresponding 
to the logical address is not in physical memory, this fault 
occurs . 

2 Non-resident Level 2 pagetable. A page fault can also occur as 
a result of a logical reference which, as part of its logical 
address translation, accesses a non-resident pagetable. 



Note: if the system detects another page fault while the processor is 
responding to a page fault as described in the above sequence, a machine 
exception occurs. This check prevents the generation of an infinite number 
of page faults. 



6.3.7 BtQfigssing s_f System Exceptiens 



When the machine detects a system exception, it accesses the exception 
handler by one of two methods. First, address 1C (hex) of page of ring 
points to the vector valid fault's exception handler. Second, a single 
exception handler serves all other system exceptions; address C (hex) of 
page of ring contains its address. Information passed in registers A3, 
A4, and AS describes these exceptions. As soon as the System pushes a 
return block on the current ring stack, it loads an exception code into 
A5. Byte and 1 of this code are always 0. Byte 2 specifies the class of 
the exception, and Byte 3 is an optional qualifier for that particular 
class. 



In addition to the codes loaded into A5, the processor loads the logical 
address of the failure into A4, and loads the number of bytes stored in the 
return block into A3 if a context block is saved (FRL=00) . In fact, the 
FRL field of the PSW pushed specifies the type of return block: short, 
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long, extended, or context. Whereas the number of bytes for short, long, 
and extended are invariant for the life of the architecture, the context 
bloGk is implementation dependent. 



Table 6-1 lists the class codes and qualifiers placed in A5 for each excep- 
tion. 



Table 6-1: System Exceptions: Class Codes and Qualifiers 



EXCEE1IQN 
Error Exit 
Undefined Opcode 
Ring Violation 



PTE Violation 



Non-Resident Page 



BYTE 2 

CLASS (Hex) 

Highest Priority 

4 

8 



10 Lowest Priority 



BYTE 3 

QIZALIEIERS 

None 

None 

Privileged Instruction 

1 Inward Address 

2 Outward Call 

3 Inward Return 

4 Invalid Gate 

5 Invalid Frame Length 
on Return Instruction 



Read Protect 

1 Write Protect 
Execute Protect 
Invalid SDR 
Invalid Level 1 
Invalid Level 2 



PTE 
PTE 



Invalid I/O Access 



Level 1 PTE2 Page 

1 Level 2 DATA Page 



6.4 Machine Exceptions 



Machine exceptions comprise the most drastic of exceptions. Hardware 

failures, such as memory errors, which cannot be corrected, and parity 

errors cause machine exceptions. In addition, the following programmer 
visible conditions result in machine exceptions: 
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1 Unaligned Ring stack. The stacks in ring must always be 
aligned on a 32-bit boundary. If not, a machine exception 
occurs . 

2 Page fault during a page fault. If either a PTE violation trap 
or a non-resident page fault occurs while the machine is chang- 
ing context to service one of these two exceptions, a machine 
exception occurs. If this condition were not detected, an 
infinite number of page faults would occur. 

3 Non-resident data for segment descriptor registers. If a non- 
resident page fault occurs for the data read by either the load 
kernel segment descriptor registers (ldkdr) or load process seg- 
ment descriptor registers (ldsdr) , a machine exception results. 

4 Unaligned data for segment descriptor registers. If the data to 
be loaded by either an ldkdr or an ldsdr are not word aligned on 
a 32-bit boundary, a machine exception also occurs. 

5 Execution of ldkdr after virtual memory is enabled. The execu- 
tion of an ldkdr (load kernel segment descriptor registers) 
instruction after virtual memory has been enabled causes a 
machine exception. 

6 Invalid SDRO after virtual memory is enabled. If SDRO is 
accessed while its valid bit is cleared, the resulting exception 
cannot be processed and a machine exception results. 



The processing of machine exceptions is implementation dependent. In gen- 
eral, however, further processing is not possible. The machine might post 
a message to the console or to an error logging device, might alert a diag- 
nostic processor (if one exists), or might simply halt the machine. The 
details of machine exception processing are presented in Ths Har dware Handz. 
haals. 
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CHAPTER 7 

7 I/Q and In£s 



All I/O is memory mapped, which means that there are no explicit central 
processing unit (CPU) instructions that reference I/O control or data 
registers. I/O registers and status bits are referenced through an 
appropriate logical to physical address mapping. The I/O register space is 
1 billion bytes. In essence, up to 1 billion I/O registers can be refer- 
enced. As a function of the implementation, however, certain types of 
operand references may cause undesirable side effects. Generally, I/O 
operand references should be on an integral byte boundary, such that the 
least significant address bits equal to the precision of the referenced 
operand will be all O's. 

Interrupts are a result of asynchronously occurring events and belong to 
the system and not to the executing process. They are processed on an 
interrupt stack in ring 0. Because interrupts can occur during interrupt 
processing, a means to nest them must be provided. When an interrupt 
occurs, the processor will vector to a particular handler as a function of 
the source of the interrupt. 



7.1 JE and I/O. In£exn«2£ Channels 

There are two types of channels: central processing unit virtual channels 
and I/O channels. There are 256 interrupt channels in a CONVEX system. 
Within these 256 channels, 8 channels are specifically allocated to the 
central processing unit (CPU) . These 8 channels are referred to as CPU 
virtual channels 0-7; they are also addressed as channels 0-7 of the 256 
possible system-wide channels. The remaining 248 channels are allocated to 
I/O processors. The number of I/O processors and the number of channels 
allocated to I/O processors are a function of a particular implementation. 
Instructions exist for any one channel to interrupt another channel . 

All external devices and controllers, regardless of their local intelli- 
gence, interrupt the CPU on one of eight channel ports. These eight ports 
bear no relationship to the number of actual I/O channels. In fact, one 
I/O channel may initiate interrupts using more than one CPU virtual chan- 
nel. Conversely, there may be up to 248 I/O channels competing for eight 
CPU virtual channels. The "mski" instruction is used by the CPU to mask out 
interrupts selectively from a particular CPU virtual channel. 

There are up to 248 I/O virtual channels. The CPU can interrupt, individu- 
ally, any of the I/O channels through the use of the "xmti" instruction. In 
some cases, one physical I/O controller may be viewed as multiple I/O chan- 
nels. The CPU can interrupt itself by addressing channels through 7. 
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7.2 Interrupt Mechanism 

When an interrupt (e.g. I/O, or Internal Timer) occurs, the following 
actions take place. The 16 bit halfword located at bytes 4 and 5 of page 
of ring is fetched. If this halfword is 0, then the interrupt is the 
first interrupt processed. This condition is referred to as base level 
interrupt processing. If this halfword is not the first interrupt, then the 
interrupt is not the first interrupt, and the processor is already at 
interrupt level. (Thus the ring stack used is the interrupt stack. In 
effect, the ring process stack pointer has temporarily become the inter- 
rupt stack pointer) . 

The fundamental difference between the two classifications is the existence 
of an interrupt stack. When the interrupt level is 0, an interrupt stack 
must be established in ring 0. Once the determination is made, the inter- 
rupted level's halfword is incremented by 1 and stored back into bytes 4 
and 5 (the increment by 1 cannot be interrupted) . 

In the following discussion, the program counter which is pushed onto the 
stack references the instruction that would have been executed if the 
interrupt had not occurred. Also, during the sequences described, all 
further interrupts are kept pending (ION is reset to 0) . 

If the interrupt is initiated by an I/O device interrupting a CPU 

virtual channel, then the CPU virtual channel interrupt is reset after the 
processor responds to the interrupt. 

7.2.1 Ring Q Stack Alignment 

The ring stack must always be aligned on a 32-bit (word) boundary. If it 
is not, a machine exception occurs. 

7.2.2 Base-Leyfil Processing 

Base- level processing occurs when the current interrupt level is 0. The 
actions that subsequently occur are determined by the current ring of exe- 
cution, be it ring or any other ring. 

7.2.3 Base-Leyfil Ring Q. 

It is assumed that the stack pointer is already initialized to the ring 
address space. Since the interrupt is at base- level, stack multiplexing to 
the interrupt stack must occur, and the following sequences are initiated: 

1 FRL is set to 01. (Extended Return Block). 

2 The extended return block is saved on the current ring stack. 
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3 The PSW is cleared. 

4 The updated stack pointer is saved in byte address 2C (hex) of 
page 0, of ring 0. This value is the process stack pointer at 
the top of the ring process stack. This procedure is prepara- 
tory to stack multiplexing to the interrupt stack. 

5 The stack pointer (A0) and frame pointer (A7) are loaded from 
byte address 20 (hex) of page 0, of ring 0. This is the inter- 
rupt stack pointer . 

6 A common hardware interrupt sequence is then executed. 



When a return to base-level is performed, the interrupt dismissal routine 
moves the previous process SP (in byte address 2C (hex) of ring 0) to A0 
prior to executing the rtn instruction. 

7.2.3.1 Base-Lsyai Eracssslng — Nan Sing Q 

In this case, the hardware performs a crossing to ring and establishes an 
interrupt stack. 

1 A ring crossing to ring is executed (as if the sysc instruc- 
tion were executed) . 

2 The steps described in the above section, for Base Level ring 0, 
are now executed. 



7.2.4 Interrupt 

At interrupt level, the ring stack has already been initialized to the 
interrupt stack. 

7.2.4.1 Ring Q~ - Interrup t Lgye_l 

An extended return block is pushed onto the current stack and the common 
interrupt sequence is then entered. 

7.2.4.2 Hon-Sing 0-- Interrup t LfiMfil 

In this case, the following actions are taken: 

1 A ring crossing to ring is executed. 

2 Since the ring stack has already been initialized to the 
interrupt stack, an extended return block is pushed on the ring 
stack, and the common interrupt sequence entered. 
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7.3 Cflmmsm interrupt Sequence 

The following section describes the actions undertaken after a crossing to 
ring and the establishment of an interrupt stack have occurred. There are 
two causes of an interrupt: an I/O device via a virtual channel (byte 
address 08 (hex) ) , and an Interval Timer (byte address 14 (hex) ) . 

1 Byte addresses 08, 10, and 14 (all hex) of ring contain the 
address of the appropriate interrupt handler. This address, 
selected by hardware, is loaded into the program counter. If 
the interrupt is caused by an I/O device, the identification of 
the interrupting device is loaded into A5 after the return block 
is pushed. This identification takes the form of 29 bits fol- 
lowed by a 3-bit encoding. The 3-bit encoding identifies which 
CPU virtual channel initiated the interrupt. 

2 The first instruction of the interrupt handler is now executed. 
Interrupts are not enabled. The interrupt handler must expli- 
citly reenable interrupts. 



7.4 General Naias 

1 The interrupt return sequence determines whether or not the 
return is to base- level or interrupt- level as a function of the 
interrupt halfword in ring 0, page 0. 

2 The return to base- level is achieved by executing a rtn instruc- 
tion. The return to interrupt level is also achieved by execut- 
ing an rtn instruction. 

3 In order to return from an interrupt, the following steps must 
be taken by the software: 

a. First, the interrupt level is decremented by one. 

b. If the level is now zero, A7 (FP) is loaded from byte address 
2C (hex) of page 0, and the rtn instruction is executed. 

c. If the level is not zero, the rtn instruction is executed. 
A7 need not be restored, since the ring stack must still be 
the interrupt stack. 

4 The process stack pointer in page zero, of ring bytes <72..75> 
is not modified during the hardware initiated interrupt process- 
ing. 
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CHAPTER 8 
3 Inatructiao Set Overview. 



This chapter and subsequent chapters describe the instruction set. The 
instruction set is used to generate logical addresses, load, store, and 
manipulate operands, and manipulate the virtual machine mechanisms. 



8.1 

A CONVEX instruction is one of three lengths: one, two, or three halfwords. 
This is equivalent to instructions that are 16, 32, or 48 bits in length. 
Even though the fundamental unit of addressability is the byte, instruc- 
tions are addressed on a half word boundary. All instructions begin on even 
byte boundaries. Thus bit of the Program Counter (PC) is never inter- 
preted . 

The instruction set was designed to meet high standards and can be charac- 
terized as follows: 

1 The instruction set is simple and easy to understand and 
decode- -hence the adoption of a RISC (Reduced Instruction Set 
Architecture) . 

2 All manipulations are generally register to register. Loads and 
stores are needed to transfer information to and from the regis- 
ter set . 

3 Orthogonality of instructions is assured; for every data type 
manipulation, there exists the same operation. (In some cases, 
this makes no sense, and thus there are exceptions.) 



8.2 Inaiructioja Eoraats 

There are 8 instruction formats. The first 8 bits of the instruction 
(<15..8>) are encoded using a modified leading l*s (HUFFMAN) encoding 
method to indicate the instruction format. The 8 instruction formats are: 
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FORMAT 

1, [op=6] / [Ri], [Rj], [Rk] 

1 00, [op=6] , [@,L] , [Ak] , [Rj] [disp=16/32] 

2 010, [op=7] , [Rj] , [Rk] 

3 0110, [op=6], [Rj], [Rk] 

4 0111 O, [op=3] , [disp=8] 

5 0111 10, [op=4] , [Rj] , [Rk] 

6 0111 110, [op=6] , [Rk] 

7 0111 1110, [op=5] , [Rk] 

Note that R is either A, S, or V as a function of the particular opcode. 

8.3 Addressing Mfldss 

There are two generic types of addressing modes: register to register and 
register to/ from memory. 

In the register to register mode, it is assumed that all source operands to 
be manipulated have been pre-loaded into one of the various machine regis- 
ters. The destination of the result is also a register. Register to regis- 
ter instructions specify none, one, two, or three unique registers. Thus 
some instructions can have up to 3 3-bit register designation fields. The 
opcode specifies the number of register fields in the remaining bits of the 
instruction. All register to register instructions are 16-bits in length. 

8.3.1 Seferencj.ng Memory 

This section describes how to build Effective Addresses. Generally, 
instructions which reference memory to load and store operands are either 
32-bits or 48-bits in length. The difference is the length of the displace- 
ment field. The structure of these memory reference instructions is: 
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L=0 | Opcode |@|L| Aj | Rk | | Displacement | 
15 8, 7,6,5 3,2 15 



L=l | Opcode |@|L| Aj | Rk | | Displacement | 

15 8,7,6,5 3,2 O 31 O 

MEMORY LOAD AND STORE FORMATS 

The meaning of these fields is as follows: 

OPCODE = bits<15..8>. Specifies the operation to be performed on the refer- 
enced data. 

@ = bit<7> - Indirection. Specifies the existence or absence of indirec- 
tion. If @=0, no indirection is specified. If @=1, indirection is speci- 
fied. 

L = bit<6> - Length. Specifies the length of the displacement field. If 
L=0, the displacement field is a 2's complement 16-bit integer. This field 
is sign extended to 32 bits prior to being added to the contents of the 
address register specified in bits <5..3> of the opcode halfword of the 
instruction . 

Rk = bits<2..0>. Specifies the source or destination of the referenced 
operand. The opcode specifies the precision, datatype, structure, and 
direction of the move. Rk can either be a scalar register (Sk) , an address 
register (Ak) , or a vector register (Vk) . 

Aj = bits<5..3>. Specifies the A (address) register to be used to generate 
the logical address. If AO is specified, the value of is used as the con- 
tents of AO for absolute addressing. The true contents of AO are unused. If 
Al-7 is specified, then the contents (all 32 bits) of the specified address 
register are added to the displacement field to generate the final effec- 
tive address of the target address. The first byte of the target operand 
is referenced. If indirection is specified, the effective target address 
references byte of a 32-bit indirect word. 

Displacement. A 16 bit or 32 bit value that is algebraically added to the 
entire contents of an A register (A1-A7) or used directly as a byte 
address. A 16 bit displacement is sign extended to 32 bits before it is 
used. 
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8.3.2 Indirection 



Indirection is the means by which an address in logical memory can be used 
to reference the ultimate target operand. Indirection occurs after indexing 
(adding of the address register). The format of an indirect word is: 



I Byte Pointer 
31 



Only one level of indirection can be specified. 

8.3.3 Branches 

Some forms of transfers of control are Program Counter relative. In these 
cases, no indirection or indexing is specified. There is a special form of 
transfer which permits an 8-bit signed displacement to be specified within 
a 16-bit (halfword) instruction. This is covered in more detail in Chapter 
11, "Program Control Instruction Set." 

8.4 Undef j Ln^d Opcode s 

When an attempt is made to execute an undefined opcode, a system exception 
occurs. An undefined opcode is a syntactically correct instruction whose 
opcode field has no associated definition. 

An undefined opcode results in a system call through byte address OC (hex) 
of page of ring (the system exception handler) . The class code loaded 
into byte 2 of AS after a push of the return block is 1. No qualifier code 
is loaded into byte 3 of A5. 

8.5 Eorjn 0_f Presentation 

The following chapters contain the definition of the CONVEX instruction 
set. 

Some of the conventions used throughout the instruction set definition are: 

Data types are specified with the following suffixes appended to the 
opcode : 

b - Byte/8 bits integer 

h - Ha If word/16 bits integer 

w - Word/32 bits integer 

1 - Long/64 bits integer 

s - Single Precision Floating Point/32 bits 

d - Double Precision Floating Point/64 bits 
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t - True 
f - False 



Other notation used includes: 

VS - Vector Stride 

VL - Vector Length 

VM - Vector Merge 

#n - a 3 bit immediate with value 0,1,.., 7 

IN - a 32 bit 2's complement signed immediate 



In the instruction set definition, two attributes are the alteration of the 
PSW and the detection of exceptions. If these fields in the instruction 
set definition are blank, then there is no change to the PSW and no excep- 
tions are detected. 



8.5.1 Mafca-nflfcatiflP. for. Instruction Syntax 



The following meta-notation is used to describe the contents of memory in 
the assembly language syntax: 



1 Effective Address means the effective memory address of an 
operand . 

2 c (effective address) = Sk means that the contents of the Sk 
register are stored into the memory location specified by the 
effective address. 

3 Sk = c (effective address) means that the contents of the memory 
location specified by the effective address are loaded into Sk. 

4 | = means alternation. Thus (a|b) means a or b. 

5 ! = is used as a comment delimiter. All text to the right of 
the ; is an English language comment relative to the 
metalanguage on that line. 



8.5.2 Instruction Rags Layout 

The generic format and layout used in this handbook for an instruction are 
shown in Figure 8-1. 
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Figure 8-1: Instruction Page Layout 
The Name of the instruction (s) 



The assembler syntax 



Purpose : 



"The purpose or intent of the instruction" 



Format: 



The physical format of the instruction, including 
field locations and use. 

Operation: 

The FORTRAN metalanguage description of the instruction. 

PSW: 

The listing of alterations to the PSW, if any. If there is none, 
this space is left blank. 

Exceptions : 

The listing of exceptions that are detected. If a trap is enabled, 
a trap occurs. Please note that for all instructions which refer- 
ence memory, exceptions related to address translation (such as 
page faults or protection violations) can occur. 



Opcode: 



A listing of the assembly language syntax, binary opcode, and 
opcode name. One page may contain the definition of many orthogo- 
nal instructions. The binary shown for each opcode is to be placed 
in the opcode field of the instruction format. Additional fixed 
subfields within the instruction are shown in the above format sec- 
tion. The opcode is presented in three columns. The first column 
is the opcode name; the second is the binary encoding of the 
opcode, and the third is the opcode function. For example, for the 
following instruction, 
add.w Aj,Ak 0101100001 Add addr. reg. word 



add.w Aj,Ak is the opcode name and the assembly language syntax, 
and 0101100001 is the binary encoding of the opcode. Add address 
registers is a brief description of the opcode function. 

Description : 

An English description of the functions performed by the instruc- 
tion. 



Notes : 



A listing of notes that may be of interest to the understanding and 
use of this or other appropriate instructions. 
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CHAPTER 9 

9 Address Register Instruction Set 

This chapter describes the instructions which manipulate the Address regis- 
ters: 

1 Loads and Stores 

2 Arithmetics 

3 Logical Operations 

4 Shi ft/Push/P op/Mo ve/PSW/Ef fective Address 

5 Compares 

6 A Register Conversions 



9 . l Overview. 

The instructions defined in this chapter manipulate operands in the A 
registers. When these operands are less than 32 bits, only the specified 
bits of the A registers are modified in a known way. All other bits are 
left unchanged. 

Instructions are provided which load and store bytes, half words, and words. 
Arithmetic instructions are provided to: add, subtract, multiply, and 
divide halfwords and words. All arithmetics are performed using 2's comple- 
ment manipulations. Overflow is generated and stored in the PSW. Integer 
traps can be disabled. Logical operations operate on words only. A full 
set of signed and unsigned compares is provided. These compares set or 
reset the carry flag. 



9 . 2 Leads and s tores 

These instructions describe the means by which operands are loaded and 
stored to/ from address registers . These instructions include Load Address 
Register, Store Address Register, and Load Address/ Immediate. Load and 
Store work with byte, halfword, and word operands, and Load Immediate with 
halfword and word. These instructions do not affect the flags in the Pro- 
gram Status Word. 



9.3 

These instructions perform 2's complement arithmetics on the contents of 
the specified address registers. Included are instructions for Add, Sub- 
tract, Multiply, Divide, and Negate Address/Address, and the following 
Immediates: Add Address/Immediate, Subtract Address/Immediate, Multiply 
Address/Immediate, and Divide Address/Immediate. Add Scalar Address is 
also included here. All Arithmetic instructions are either halfword or 
word and affect the flags in the Program Status Word as follows: 
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Arithmetic Overflow is signified by the AIV bit in the PSW, the C bit 
reflects a carry or borrow out of the most significant bit of the opera- 
tion, and the ADZ bit is set on the occurrence of a divide by zero. 

9.4 Logical Qpsxaticns 

Logical operations perform a bitwise operation between two address regis- 
ters. Four sets of logicals are provided (AND, OR, XOR, Complement). All 
32 bits of Ak and Aj participate in the logical operation. The instruc- 
tions are AND Address/Address, OR Address/Address , Exclusive OR 
Address/Address, and Complement Address/Address, and the following Logical 
Immediates: AND Address/Immediate, OR Address/Immediate, and Exclusive OR 
Address/Immediate . 

9.5 Sbift/Euah/Eflp/Msyfi/ESH/Llfectiys Address 

These instructions include Logical Shift Address/Address, Logical Shift 
Address/Immediate, Load Effective Address, Push Effective Address, Move 
PC/Address, Move Address/Address, Push Address Register, Pop Address Regis- 
ter, Test and Set, Load Physical, Move PSW/Address, and Move Address/PSW. 
Move PC/Address, Push, and Pop Address Registers, Move PSW/Address and Move 
Address/PSW all use word operands; Test and Set operates on a byte 
operand . 



9.6 Compares 

Address register comparisons result in the setting or clearing of the carry 
bit (bit 31 of the PSW) . If the specified comparison is true, the carry bit 
is set to 1. If the comparison is false, the carry bit is cleared to 0. 
There are 2 sets of comparisons: signed and unsigned. Unsigned comparisons 
treat both operands as positive values. For each set of comparisons, an 
instruction group is provided which permits the specification of an immedi- 
ate. One form of the immediates permits a register within a 16 bit 
instruction to compare with the values: 0,1,..., 7. This is particularly 
useful in, comparing with 0. 

Typically, after the execution of one of these compare instructions, a 
branch on carry instruction is executed. The strategy adopted for compares 
and branches is as follows: provide a compare register-to-register instruc- 
tion for one of three of the possible six compare relations. If the 
required relation is not one of the three provided, then the complement of 
the relation is used along with the complement of the branch condition. 

The following are the complementary relations used for compares: 

•LE. <-> .GT. 
•LT. <-> .GE. 
.NE. <-> .EQ. 
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For example A .LE. B , BRANCH TRUE is equivalent to A .GT. B, BRANCH FALSE. 
Coincidental ly, for many operators A .REL. B is identical to B .not. .REL. 
A. Thus A .LE. B, BRANCH TRUE is equivalent to B .GE. A, BRANCH TRUE. 

The Compare instructions include Compare Address/Address, Compare 
Address/Address Unsigned, Compare Address/Immediate, and Compare 
Address/Immediate Unsigned. These operands are all half-word and word 
instructions, and affect the C flag as the result of a compare. 



9-7 A Register CanYersians 

This instruction- -Convert Integer Address/Address--converts the various A 
register integer operands from one precision to another and affects the AIV 
flag in integer overflow. 
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LOAD ADDRESS REGISTER 



Id. (b|h|w) <effa>,Ak 



Purpose: 



To load memory operands into the address registers. 



Format: 



| Opcode |@|L| Aj | Ak | 
15 8,7,6,5 3,2 



| Displacement | 
(31,15) 



Operation: 

Ak = c (Effective Address) 

PSW: 

Exceptions : 

Opcode : 

ld.b <effa>,Ak 001010000 

ld.h <effa>,Ak 001010010 

ld.w <effa>,Ak 001010100 



Load addr. reg. byte 
Load addr. reg. halfword 
Load addr. reg. word 



Description : 

The operand referenced by the effective address is loaded into the 
address register Ak. 



Notes i 



Byte operands are loaded into bits<7. .0> of the specified A regis- 
ter. Halfword operands are loaded into bits<15..0> of the speci- 
fied A register. All other bits are left unchanged. 
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STORE ADDRESS REGISTER 



st.(b|h|w) Ak,<effa> 



Purpose : 



To store the contents of an address into memory. 



Format: 



| Opcode |@|L| Aj | Ak | 
IS 8,7,6,5 3,2 O 



Operation: 

c (Effective Address) = Ak 

PSW: 

Exceptions : 

Opcode : 

st.b Ak,<effa> 001011000 

st.h Ak,<effa> 001011010 

st.w Ak,<effa> 001011100 



| Displacement | 
(31,15) 



Store addr. reg. byte 
Store addr. reg. halfword 
Store addr . reg . word 



Description: 

The contents of the source address register Ak are stored into the 
memory location referenced by the effective address. 



Notes ) 



Higher order bits of the Ak register used as the source are ignored 
for byte and halfword stores. 
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LOAD ADDRESS/IMMEDIATE 



Id. (h|w) #(n|N),Ak 



Purpose: 



Format: 



To load an address register with an immediate operand. 

Short Immediate 
| Opcode | n | Ak | 
15 6,5 3,2 
Long Immediate 

I Opcode |L| OOO | Ak | | N | 

15, 7,6 5 3,2 (31|15) 



Operation : 

Short: Ak = #n 
Long: Ak = #N 

PSW: 

Exceptions : 

Opcode : 

ld.h #N,Ak 
ld.w #N,Ak 

ld.h #n,Ak 
ld.w #n.Ak 



!#n is 0,1 7. 

!#N is an Immediate. 



000100010 Load half word imm. into Ak 

000100011 Load imm. into Ak 

0100010010 Load short imm. into Ak 

0100010011 Load short imm. into Ak 



Description: 

The specified immediate field is loaded into the specified address 
register . 



Section 9.7 



9-6 



Address Register Instruction Set 

ADD ADDRESS/ADDRESS add.(h|w) Aj,Ak 



Purpose: 

To add the contents of one address register to another. 

Eormat: 

| Opcode | Aj | Ak | 

15 6,5 3,2 



Operation: 

Ak = Ak + Aj 

PSW: 

C = Carry out of the most significant bit 
AIV = Integer Overflow 

Exceptions : 

Integer Overflow 

Opcode : 

add.h Aj,Ak 0101100000 Add addr. reg. half word 
add.w Aj,Ak 0101100001 Add addr. reg. word 

Description : 

The contents of the address register Ak are added to the address 
register Aj and the result loaded into the Ak. 

Notes : 
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MULTIPLY ADDRESS/ADDRESS mul . (h | w) Aj , Ak 



Purpose : 

To multiply the contents of one address register with another. 

Format : 

| Opcode | Aj | Ak | 

15 6,5 3,2 

Operation: 

Ak = Ak * A j 

PSW: 

C = Unchanged 

AIV = Integer overflow 

Exceptions: 

Integer Overflow 

Opcode : 

mul.h Aj,Ak 0101110000 Multiply addr. reg. halfword 
mul.w Aj,AK 0101110001 Multiply addr. reg. word 

Description: 

The contents of the address register Aj are multiplied by the 
address register Ak and the results loaded into Ak. 

Notes : 

The precision of the result is equal to the precision of the Ak. 
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DIVIDE ADDRESS/ADDRESS div.(h|w) Aj,Ak 



Purpose : 

To divide the contents of one address register by another. 

Format : 

| Opcode 1 Aj | Ak | 

15 6,5 3,2 

Operation : 

Ak = Ak / Aj 

PSW: 

AIV = Integer overflow 

ADZ = Address Register divide by zero 

Exceptions : 

Integer Overflow 
Divide by Zero 

Opcode : 

div.h Aj,Ak , 0101111000 Divide addr. reg. halfword 
div.w Aj,Ak 0101111001 Divide addr. reg. word 

Description : 

The contents of the address register Ak are divided by the address 
register Aj and the result loaded into the address register Ak. 

Notes : 

1 Integer overflow occurs if the largest negative number 
is divided by -1. 

2 If a divide by is detected, the result is the original 
dividend . 
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NEGATE ADDRESS/ADDRESS neg. (h|w) Aj,Ak 



Purpose : 

To negate the contents of an address register. 

Format: 

| Opcode | Aj | Ak | 

15 6,5 3,2 

Operation: 

Ak = - Aj 

PSW: 

AIV = Integer overflow 
C = Carry Out 

Exceptions : 

Integer Overflow 

Opcode : 

neg.h Aj,Ak 0101011010 Negate addr . reg. halfword 
neg.w Aj,Ak 0101011011 Negate addr. reg. word 

Description : 

The two's complement of Aj are loaded into Ak. 



Notes : 



1 The negate operation is identical to subtracting the 
contents of a register from 0. 

2 Overflow can occur only for the negation of the most 
negative number. 
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ADD ADDRESS/IMMEDIATE 



add. (h|w) #(n|N) ,Ak 



Purpose : 



Format: 



To Add an immediate field to an address register. 

Short Immediate 
| Opcode | n | Ak | 
15 6.5 3,2 
Long Immediate 

| Opcode |L| 000 | Ak | | N 
15, 7,6 5 3,2 (31|15) ( 



Operation: 

Short: Ak = Ak + #n 
Long: Ak = Ak + #N 



!#n is 0,1 7. 

!#N is an Immediate. 



PSW: 



C = Carry Out 

AIV = Integer Overflow 



Exceptions : 

Integer Overflow 



Opcode : 



add.h #n,Ak 
add.w #n,Ak 
add.h #N,Ak 
add.w #N,Ak 



0101100010 Add short imm. address halfword 

0101100011 Add short imm, address word 

000101000 Add 1mm. address halfword 

000101001 Add imm. address word 



Description : 

The immediate field is added to the destination field with the 
result loaded into the destination field. 
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SUBTRACT ADDRESS/IMMEDIATE 



sub. (h|w) #(n|N) ,Ak 



Purpose: 



Format! 



To Subtract an immediate field from an address register. 

Short Immediate 
| Opcode | n | Ak | 
15 6,5 3,2 
Long Immediate 
| Opcode |L| 000 | Ak | | 



N 



15, 



7,6 5 3,2 



Operation : 

Short: Ak = Ak - #n 
Long : Ak = Ak - #N 

PSW: 

C = Carry Out 

AIV = Integer Overflow 

Exceptions : 

Integer Overflow 



(31 | 15) 



!#n is 0,1 7. 

!#N is an Immediate. 



Opcode : 



sub.h #N,Ak 
sub.w #N,Ak 
sub.h #n,Ak 
sub.w #n,AK 



000101010 Subtract imm. address halfword 

000101011 Subtract imm. address word 

0101101010 Subtract short imm. address halfword 

0101101011 Subtract short imm. address word 



Description: 

The immediate field is subtracted from the address register Ak and 
the result loaded into Ak. 

Notes : 
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MULTIPLY ADDRESS/IMMEDIATE 



mul. (h|w) #(n|N) ,Ak 



Purpose : 



Format: 



To multiply an immediate field with an address register . 

Short Immediate 

| Opcode | n | Ak | 

_ _, _ 

IS 6,5 3,2 
Long Immediate 
| Opcode |L| 000 | Ak | | 



N 



15, 



7,6 5 



3,2 



Operation: 

Short: Ak = Ak * #n 
Long : Ak = Ak * #N 



(31 | 15) 



!#n is 0,1 7. 

!#N is an Immediate. 



I 




PSW: 



AIV = Integer Overflow 



Exceptions : 

Integer Overflow 



Opcode : 



mul . h #n , Ak 
mul.w #n,Ak 
mul.h #N,Ak 
mul.w #N,Ak 



0101110010 Multiply short imm. address halfword 

0101110011 Multiply short imm. address word 

000101100 Multiply imm. address halfword 

000101101 Multiply imm. address word 



Description : 

The address register Ak is multiplied by the immediate and the 
result loaded into Ak. 

Notes : 
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DIVIDE ADDRESS/IMMEDIATE 



div. (h|w) #(n|N) ,Ak 



Purpose : 



To divide an address register by an immediate. 



Format: 



I Ak | 
3,2 



Short Immediate 
| Opcode | n 
15 6,5 
Long Immediate 

| Opcode |L| 000 | Ak | | N 
15, 7,6 5 3,2 (31 | 15) 



I 




Operation : 

Short: Ak = Ak / #n 
Long : Ak = Ak / #N 



!#n is 0,1 7. 

! #N is an Immediate , 



PSW: 



AIV = integer overflow 
ADZ = integer divide by 



Exceptions : 

Integer Overflow 
Divide by Zero 



Opcode : 



div.h #n,Ak 
div.w #n,Ak 
div.h #N,Ak 
div.w #N,Ak 



0101111010 
0101111011 
000101110 
000101111 



Divide short imm. address half word 
Divide short imm. address word 
Divide imm. address half word 
Divide imm. address word 



Description: 

The address register Ak is divided by the immediate and the result 
loaded into the address register Ak. 



Notes : 



1 Integer overflow occurs if the largest negative number 
is divided by -1. 

2 For divide by 0, the result is the original dividend. 
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ADD SCALAR/ADDRESS add.w Sj,Ak 



Purpose: 

To add a scalar register to an address register. 

Format: 

| Opcode | Sj | Ak | 

15 6,5 3,2 

Operation: 

Ak = Ak + Sj<31. ,0> 

PSW: 

C = Carry out of the most significant bit 
AIV = Integer Overflow 

Exceptions : 

Integer Overflow 

Opcode : 

add.w Sj,Ak 0101000000 Add scalar to addr word 

Description : 

The contents of the Sj register are added to the contents of the Ak 
register . 

Notes : 
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AND ADDRESS/ADDRESS and Aj,Ak 



Purpose : 

To AND the contents of two address registers. 

Format: 

| Opcode | Aj | Ak | 

15 6,5 3,2 



Operation: 

Ak = Ak .AND. Aj 

PSW: 

Exceptions : 

Opcode : 

and Aj,Ak 0101001000 AND addr. reg. 

Description: 

The contents of the address register Ak are ANDed with the address 
register Aj and the result loaded into the address register Ak. 
All 32 bits of the A registers participate in the operation. 

Notes: 
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OR ADDRESS/ADDRESS 



or Aj , Ak 



Purpose : 

To or the contents of two address registers. 

Format : 

I Opcode | Aj | Ak | 
15 6,5 3,2 



Operation : 

Ak = Ak .OR. Aj 

PSW: 

Exceptions : 

Opcode : 

or Aj,Ak 0101001001 OR addr. reg. 

Description: 

The contents of the address register Ak are ored with the address 
register Aj and the result loaded into the address register Ak. 
All 32 bits of the A registers participate in the operation. 

Notes : 
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EXCLUSIVE OR ADDRESS/ADDRESS xor Aj,Ak 



Purpose : 

To exclusive OR the contents of two address registers. 

Format : 

| Opcode | Aj | Ak | 

15 6,5 3,2 

Operation: 

Ak = Ak .XOR. Aj 

PSW: 

Exceptions : 

Opcode : 

xor Aj,Ak 0101001010 Exclusive OR addr. reg. 

Description : 

t The contents of the address register Ak are exclusive ORed with the 
address register Aj and the result loaded into the address register 
Ak. All 32 bits of the A registers participate in the operation. 
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COMPLEMENT ADDRESS/ADDRESS not A j /Ak 



Purpose : 

To COMPLEMENT an address register. 

Format: 

| Opcode | Aj | Ak | 

15 6,5 3,2 

Operation : 

Ak = .NOT. Aj 

PSW: 

Exceptions : 

Opcode : 



not Aj,Ak 0101001011 Complement addr. reg. 

Description : 

The contents of the Aj address register are complemented; this 
value is then loaded into the Ak address register. 

Notes : 
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AND ADDRESS/IMMEDIATE and #N,Ak 



Purpose: 

To AND an immediate field to an address register. 

Format: 

| Opcode |L| 000 | Ak | | N 

15, 6,5 3,2 31 | 16 



Operation : 

Ak = Ak .AND. Immediate 

PSW: 

Exceptions : 

Opcode : 

and #N,Ak 000100100 AND imm. to addr. reg. 

Description : 

The address register Ak is logically ANDed with the immediate 
operand, and the result is loaded into Ak. 

Notes : 

1 16-bit immediates are sign-extended to 32 bits. 
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OR ADDRESS/IMMEDIATE or #N/Ak 



Purpose : 

To or an immediate field with an address register. 

Format: 

| Opcode |L| 000 | Ak | | N . | 
15, 6,5 3,2 31116 C 



Operation : 

Ak = Ak .OR. immediate 

PSW: 

Exceptions : 

Opcode: 

or #N,Ak 000100101 OR imm. to addr. regv 

Description: 

The address register Ak is logically ORed with the immediate 
operand and the result loaded into Ak. 

Notes : 

1 16-bit immediates are sign-extended to 32 bits. 
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EXCLUSIVE OR ADDRESS/IMMEDIATE xor #N,Ak 



Purpose: 

To exclusive OR an immediate field with an address register. 

Format: 

| Opcode |L| 000 | Ak | | N | 

15, 6,5 3,2 31|16 



Operation: 

Ak = Ak .XOR. Immediate 

PSW: 

Exceptions : 

Opcode : 

xor #N,AK 000100110 Exclusive OR imm. to addr. reg. 

Description : 

The address register Ak is logically exclusive ORed with the 
immediate operand, and the result is loaded into Ak. 

Notes : 

1 16-bit immediates are sign extended to 32 bits. 
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LOGICAL SHIFT ADDRESS/ADDRESS shf A j,Ak 



Purpose: 

To logically shift the contents of an address register. 

Format: 

| Opcode | Aj | Ak | 

15 6,5 3,2 

Operation: 

Ak = shift Ak by Aj<7. .0> 

PSW: 

Exceptions : 

Opcode : 

shf Aj,Ak 0101000001 Shift an address 

Description: 

The contents of the address register Ak are logically shifted 
according to a count contained within Aj<7..0>. If the count is 
positive, then Ak is shifted left. If the count is negative, then 
Ak is shifted right. Zeros fill vacated positions. All 32 bits of 
Ak participate in the shift. 



Bits<7..0> of Aj are examined to determine the shift 



count . 



Notes : 



A compare immediate instruction should be used to determine if the 
shift count in Aj is greater than 127 or less than -128. 
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LOGICAL SHIFT ADDRESS/IMMEDIATE 



shf #(n|N) ,Ak 



Purpose : 



Format: 



To logically shift with an immediate field to an address register. 



Short Immediate 



| Opcode | 
15 6,5 
Immediate 



I Ak | 
3,2 



| Opcode |L| 000 | Ak | | 

15, 8,7 6 3,2 32116 



Operation : 

Short: Ak = Shift Ak by In ! #n is 0,1 7. 

Long : Ak = Shift Ak by #N ! #N is an Immediate. 

PSW: 

Exceptions : 



Opcode : 



shf #n,Ak 0100010001 Logical shift left short imm. 
shf #N,AK 000100111 Logical shift imm. to addr. reg. 



Description : 

The address register Ak is logically shifted. The immediate field 
determines the direction and number of bits shifted. A positive 
immediate indicates a shift left. A negative immediate indicates a 
shift right. For shf by #n only a left shift is supported. 



Notes : 



shf by #n provides a convenient way to adjust an ordinal index to a 
byte offset. An example of this is A(I) , where A is an array con- 
taining 64-bit integers, and I is contained in an address register. 
I must be shifted left by 3 (multiply by 8) to convert I into into 
a byte offset relative to the start of A. 
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LOAD EFFECTIVE ADDRESS 



Idea <effa>,Ak 



Purpose: 



To load an address register with a byte pointer. 



Format: 



I Opcode |@|L| Aj | Ak | 

15 8,7,6,5 3,2 O 

Operation : 

Ak = Effective Address 

PSW: 

Exceptions: 

Opcode : 

Idea <effa>,Ak 000010010 



| Displacement | 
(31,15) 



Load effective address 



Description: 

The effective address, determined by evaluating the L,@,Aj fields. 



Notes : 



is loaded into Ak. 



No ring violation occurs if the developed effective address refer- 
ences an inner ring. 
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PUSH EFFECTIVE ADDRESS pshea <effa> 



Purpose: 

To push a byte pointer onto the stack. 

Format: 

| Opcode |@|L| Aj| k | | Displacement | 

15 8,7 6,5,3,2 (31|15) 



Operation: 

temp = Effective Address 
AO = AO -4 
c (AO) = temp 

PSW: 

Exceptions : 

Opcode : 

pshea <effa> 000011010 Push effective address 

Description: 

The effective address determined by evaluating the L,@,Aj fields is 
pushed onto the stack. 



Notes : 



1 No ring violation occurs if the developed effective 
address references an inner ring. 

2 The K field of the instruction is not used. 
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MOVE PC/ADDRESS mov PCAk 



Purpose: 

To move the address of the next instruction into an address regis- 



ter . 




Format: 




| Opcode | Ak | 


15 3,2 





Operation: 




Ak = Current-Address + 2 




PSW: 




Exceptions : 




Opcode : 





mov PC,Ak 0111110001010 Load next PC address 

Description: 

The address of the instruction following this instruction is loaded 
into Ak. 

Notes : 
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MOVE ADDRESS/ADDRESS mov Aj,Ak 



Purpose: 

To move the contents of one address register to another. 

Format: 

| Opcode | Aj | Ak | 

15 6,5 3,2 



Operation: 

Ak = Aj 

PSW: 

Exceptions : 

Opcode : 

mov Aj,Ak 0101000010 Move addr. reg. 

Description: 

i The contents of Aj are moved to Ak. Aj remains unchanged. 

Notes : 
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PUSH ADDRESS REGISTER psh.w Ak 



Purpose: 

To push an address register onto the stack. 

Format : 

| Opcode | Ak | 

15 3,2 



Operation : 

AO = AO - 4 
c(AO) = Ak 

PSW: 

Exceptions : 

Opcode : 

psh.w Ak 0111110100000 Push an addr. reg. 

Description : 

The contents of the Ak address register are pushed onto the stack 



Notes : 



1 When the register to be pushed is the stack pointer | 

itself, AO, the value pushed is the value after AO is | 

decremented by 4; in other words, the value after the | 
instruction is executed. 
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POP ADDRESS REGISTER pop.w Ak 



Purpose : 

To pop a word from the stack into an address register . 

Format: 

| Opcode | Ak | 

15 3,2 



Operation : 

Ak = c(AO) 
AO = AO + 4 

Exceptions : 

Opcode : 

pop.w Ak 0111110100010 Pop word into addr. reg. 

Description : 

The contents of Ak are loaded from a word at the top of the stack. 
The stack pointer is then incremented by 4 to reference the dew top 
of stack. 

Notes : 
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TEST AND SET tas <effa> 



Purpose : 

To indivisibly set a byte in memory. 

Format : 

| Opcode |<§|L| A j | k | | Displacement | 

15 8,7 6,5,3,2 (31|15) 



Operation : 

IF (c (Effective Address) .EQ. 0000 0000) THEN 

C = 1 
ELSE 

C = 
c(effa) = 1111 1111 
The Read and Write of memory is non-divisible. 

Exceptions : 

Opcode : 

tas <effa> 000011000 Test and Set a memory byte 

Description : 

The referenced byte is tested for all 0. If the byte is all 0, 
then carry is set to 1. Otherwise carry is set to 0. 



Notes : 



1 The test and set byte is used to test a byte in memory 
indivisibly. No I/O operation is permitted between the 
read and write of the referenced byte. 

2 The K field is unused. 
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LOAD PHYSICAL ldpa Aj,Ak 



Purpose: 

To convert a logical address to a physical address and load it into 
an address register. 

Format : 



| Opcode | Aj | Ak 
15 6,5 3,2 



Operation: 

The logical address in Aj is converted into a physical address. The 
address is checked for validity in the following sequence: 

ATU enabled; 

Ring maximization; 

Valid SDR; 

Valid Level 1 PTE; 

Valid Level 2 PTE; 

Resident Level 2 Page Table; 

Resident Data Page. 

If any of the above checks fail, the carry (C) is set to 1, and A5 
receives an error code. If the translation succeeds, the carry (C) 
is set to 0, and A5 receives the physical address. In most cases, 
Aj receives the address of the last Page Table Entry (PTE) , and Ak 
receives the PTE itself. However, Aj and Ak will be set to if: 

Logical equals physical. 
Ring maximization fails, or 
The SDR is invalid. 

PSW: 





C = 


1 


if 


invalid reference 




C = 





if 


valid 


reference. 


Exceptions: 










Opcode : 














ldpa Aj,Ak 


0100010000 



Load a physical byte address into Ak 

Description : 

The contents of Aj are assumed to be a logical address, and this 
address is translated to its equivalent physical address. If the 
logical address is valid, the physical address is placed in A5 and 
the carry bit (C) in the PSW is set to 0. If the logical address 
in invalid, error information is returned in A5, and C is set to 1. 
In either case, the physical address of the last Page Table Entry 
(PTE) is placed in A j , and the PTE itself is loaded in Ak. 
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The error information placed in A5 is consistent with System Excep- 
tion Conditions, as described in Chapter 6. The following table 
shows the potential errors which can occur with their corresponding 
codes: 

Error Class Qualifier 

ATU not enabled none 

Ring Violation 8 1 Inward Address Reference 

PTE Violation c 4 Invalid SDR 

5 Invalid Level 1 PTE 

6 Invalid Level 2 PTE 

Non-resident Page ' 10 Non-resident Level 2 Page Tabl< 

1 Non-resident Data Page 



Notes : 



1 After the instruction has completed, the carry, C, sig- 
nifies the validity of the translation. If C=l, the 
address was invalid, and AS contains an error code as a 
result. If C=0, the address was valid and A5 contains 
the physical address as a result. 

2 For Logical=Physical , Inward Address Reference, and 
Invalid SDR errors, the contents of Aj and Ak are unde- 
fined . 

3 The PTE returned in Ak is useful for checking for Trojan 
Horse Pointers (addresses provided by the user to the 
system for system call processing -- see Chapter 4 of 
the handbook for details) . Also contained in the PTE 
are the access privileges of the referenced address. 

4 This instruction can also be used to determine whether 
or not a page is resident in main memory. 

5 No access checks are performed (i.e. Read, Write, or 
Execute) . 
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MOVE PSW/ADDRESS mov PSW,Ak 



Purpose : 

To move the PSW into an address register . 

Format: 

| Opcode | Ak | 

15 3,2 

Operation : 

Ak = PSW 

PSW: 

Exceptions : 

Opcode : 

mov PSW,Ak 011111OOO10O0 Store the PSW into an addr. reg. 

Description : 

The PSW is loaded into the specified address register Ak. The PSW 
is unchanged after the loading of Ak. 

Notes : 

Before the PSW is moved to Ak, all existing concurrent processing 
is completed. This ensures that all exception condition flags 
accurately reflect the state of the processor. 
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MOVE ADDRESS/PSW mov Akpsw 



Purpose : 

To move an address register into the PSW 



Format: 

I Opcode | Ak | 

15 3,2 



Operation: 

PSW = Ak 

PSW: 

Exceptions : 

Opcode : 

mov Ak,psw 0111110001001 Load an addr. reg. into the PSW 

Description: 

The contents of the specified Ak are loaded into the PSW. 

Notes : 

Before Ak is moved to the PSW, all existing concurrent processing 
is completed. This ensures that all exception flags and trap 
enables accurately reflect the sequential state of the processor. 
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COMPARE ADDRESS/ADDRESS 



(le|lt|eq) . (h|w) Aj,Ak 



Purpose: 

To compare the contents of 2 address registers. 

Format: 

| Opcode | Aj | Ak | 

15 6,5 3,2 

Operation: 

IF (Aj .OPCODE-TEST. Ak) THEN 

C=l 
ELSE 

C=0 
ENDIF 



PSW: 



C is affected (see above) 



Exceptions : 
None 



Opcode : 



le.h Aj ,Ak 
le.w Aj ,AK 

lt.h Aj,Ak 
lt.w Aj,Ak 

eq .h Aj, Ak 
eq .w Aj, Ak 



OlOOllOOOO 
0100110001 

0100111000 
0100111001 

0100011000 
0100011001 



Compare less than or equal signed half word 
Compare less than or equal signed word 

Compare less than signed halfword 
Compare less than signed word 

Compare equal halfword 
Compare equal word 



Description : 

The contents of Ak are compared with A j . The result of the com- 
parison is used to modify the C bit of the PSW. If the comparison 
is true, the C bit is set to 1. If the comparison is false, the C 
bit is reset to 0. 
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Notes : 



1 The contents of the Ak and Aj registers are unmodified. 

2 The comparison A .NE. B, A .GT. B and A .GE. B is imple- 
mented by comparing A .EQ. B, A .LT. B and A .LE. B 
respectively, and testing for complemented C. 

3 Unsigned equal is equivalent to signed equal . 
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COMPARE ADDRESS/ADDRESS UNSIGNED (le | It) u. (h | w) Aj,Ak 



Purpose: 

To perforin unsigned comparisons between the contents of two address 
registers . 



Format: 






| Opcode | Aj | Ak | 


15 


6,5 3,2 





Operation: 






IF (Aj 


.OPCODE -TEST. Ak) 
C = 1 


THEN 


ELSE 


C = 




ENDIF 







PSW: 

C is affected (see above) . 

Exceptions : 
None 

Description: 

The contents of Ak are compared with A j . The result of the unsigned 
comparison is used to modify the C bit of the PSW. If the com- 
parison is true, the C bit is set to 1. If the comparison is false, 
the C bit is reset to 0. 



Opcode : 



Notes ! 



leu.h Aj,Ak 0100100000 Compare unsigned less than or equal halfwor 

ltu.h gt,Aj,Ak 0100101000 Compare unsigned less than halfword 

leu.w Aj,Ak 0100100001 Compare unsigned less or equal than word ' 

ltu.w gt,Aj,Ak 0100101001 Compare unsigned less than word 



1 Ak and Aj are not modified. 

2 Unsigned equality and inequality are performed by the 
signed comparisons for equality and inequality. 
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COMPARE ADDRESS/IMMEDIATE 



(le|lt|eq) . (h|w) #(n|N),Ak 



Purpose: 



Format: 



To compare the contents of an address register with an immediate. 

Short Immediate 

| Opcode | n | Ak | 
15 6,5 3,2 
Long Immediate 

| Opcode |L| 00O | Ak | | N | 

15, 7,6 S 3,2 (31|15) 



Operation : 

Short : 



PSW: 



IF ( Ak .OPCODE-TEST. #n) THEN !#n is (0,1 7). 

C = Lj_ 

ELSE 

C = 

ENDIF 

Long: 

IF ( Ak .OPCODE -TEST. #n) THEN !#N is an Immediate. 

C = 1 
ELSE 

C = 
ENDIF 



C is affected (see above) 



Exceptions : 
None 

Opcode : 

le.h #n,Ak 
le.w #n,Ak 
lt.h #n,Ak 
lt.w #n,Ak 

eq.h #n,Ak 
eq.w #n,Ak 

le.h #N,Ak 

Section 9.7 



0100110010 Compare less than or equal halfword 

0100110011 Compare less than or equal word 

0100111010 Compare less than halfword 

0100111011 Compare less than word 

0100011010 Compare equal halfword 

0100011011 Compare equal word 

000111100 Compare less than or equal halfword 
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le.w #N,Ak 000111101 Compare less than or equal word 
lt.h #N,Ak 000111110 Compare ".ess than halfword 
lt.w #N,Ak 000111111 Compare less than word 

eq.h #N,Ak 000110110 Compare equal halfword 
eq.w #N,Ak 000110111 Compare equal word 

Description : 

The contents of the Ak register are compared to the specified 
immediate. If the comparison is true, the C bit of the PSW is set 
to 1. If the comparison is false, the C bit of the PSW is reset to 
0. 



Notes : 



1 The contents of the Ak register are not modified. 
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COMPARE ADDRESS/IMMEDIATE UNSIGNED 



(le|lt)u. (h|w) #(n|N) ,Ak 



Purpose: 



To compare the unsigned contents of an address register with an 
immediate . 



Format: 



Short Immediate 

| Opcode | n | Ak | 
15 6,5 3,2 
Long Immediate 

| Opcode |L| 000 | Ak | | N | 

15, 7,6 5 3,2 (31|15) 

Operation: 

IF ( Ak .OPCODE-TEST. #n) THEN !#n is (0,1 7). 

C = 1 
ELSE 

C = 
ENDIF 

IF ( Ak .OPCODE -TEST. #n) THEN !#N is an Immediate. 

C = 1 
ELSE 

C = 
ENDIF 



PSW: 



C is affected (see above) 



Exceptions : 
None 



Opcode : 



leu.h #n,Ak 
ltu.h #n,Ak 
leu.h #N,Ak 
ltu.h #N,Ak 

leu.w #n,Ak 
ltu.w #n,Ak 
leu.w #N,Ak 
ltu.w #N,Ak 



0100100010 
0100101010 
000111000 
000111010 

0100100011 
0100101011 
000111001 
000111011 



Compare unsigned less than or equal ha 
Compare unsigned less than halfword 
Compare unsigned less than halfword 
Compare unsigned less than halfword 

Compare unsigned less than or equal woi 
Compare unsigned less than word 
Compare unsigned less than word 
Compare unsigned less than word 



Section 9.7 



9-42 



Address Register Instruction Set 

Description : 

The contents of the Ak register are compared to the specified 
immediate. If the comparison is true, the C bit of the PSW is set 
to 1. If the comparison is false, the C bit of the PSW is reset to 
0. 

Notes : 

1 The contents of the Ak register are not modified. 

2 Compare equal immediate N is performed using the eq.x 
#N , Ak instructions . 
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CONVERT INTEGER ADDRESS/ADDRESS cvt Aj,Ak 



Purpose : 

To convert the integer contents of one A register to an integer of 
different precision. 

Format: 



| Opcode | Aj | Ak 
15 6,5 3,2 



Operation: 

Ak = Convert (Aj) ! according to the opcode 

PSW: 

AIV = Integer Overflow 

Exceptions : 

Integer Overflow 

Opcode : 

cvtw.b Aj,Ak 0100000000 Convert word to byte 

cvtw.h Aj.Ak 0100000001 Convert word to halfword 

cvtb.w Aj.Ak 0100000010 Convert byte to word 

cvth.w Aj,AK 0100000011 Convert half to word 

Description : 

The specified integer is converted to specified destination integer 
and the result is loaded into Ak. An overflow exception can occur 
for the word to byte and word to halfword instructions. Bytes and 
halfwords are sign extended to words. 

Notes : 

1 The other possible signed conversions are implemented by 
executing two of the provided conversions. Halfword to 
byte is performed by cvth.w and cvtw.b. Byte to half- 
word is performed by cvtb.w and cvtw.h. 

2 Unsigned conversions may be implemented using logical 
AND instructions. 
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CHAPTER 10 
10 Scalar Register Instruction Set 



This chapter describes the instructions which primarily manipulate the 
scalar register set. The instruction categories include Scalar Loads and 
Stores, Scalar Load Immediates, Scalar /Scalar Arithmetics, Scalar/Scalar 
Logical Operations, Arithmetic Immediates, Logical Immediates, Push/Pop 
Scalar Registers, Scalar/Scalar Compares Signed/Unsigned, S Register Com- 
pare Immediates, S Register Conversions, and Shifts/Moves/Counts. There are 
8 x 64 bit scalar registers. The scalar registers are used to perform com- 
putation on fixed and floating point operands. Instructions which affect 
both scalar and vector registers are described in the chapter on the Vec- 
tor/ Scalar Instruction Set. Details are also provided on the scheduling 
of these instructions. 



lo.i Scalar Loads and Stores 

These instructions include Load Scalar Register and Store Scalar Register; 
both instruction types work with byte, halfword, word and longword operands 
and do not affect flags in the Processor Status Word. The subgroup. Scalar 
Load Immediates, includes Load Scalar/Immediate as the only instruction, 
and its data types include word and longword. Load Scalar/Immediate does 
not affect the flags in the Processor Status. 



10 . 2 Scalar / Scalar 

Instructions included in this group include Add Scalar/Scalar, Subtract 
Scalar/Scalar, Multiply Scalar/Scalar, Divide Scalar /Scalar, and Negate 
Seal ar /Scalar . All Scalar/Scalar Arithmetics are either byte, halfword, 
word, or longword integers, or single or double precision floating point. 
Integer Overflow is signified by the SIV bit in the PSW, Exponent Overflow 
by the OV bit. Exponent Underflow by the UN bit, and Carry output by the SC 
bit. RO is used to signify a reserved operand. The SDZ bit shows a Divide 
by Zero. An integer FDZ indicates a floating point divide by zero. 



10.3 Scalar/Scalar Logical Operations 

These instructions include AND Scalar/Scalar, OR Scalar /Scalar, Exclusive 

OR Scalar/Scalar, and Complement Scalar /Scalar. No flags are affected in 
the PSW. 
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10.4 s Register Immediaiea 

These instructions provide a means to perform an operation between an S 
register and an immediate. The section has two sub groups: Arithmetic 
Immediates and Logical Immediates. Arithmetic Immediates include Add, Sub- 
tract, Multiply, and Divide Scalar/Immediate instructions which work with 
halfword, word, or Single Precision Floating Point operands and affect the 
following flags in the PSW: 

SIV = Integer Overflow; Integer only 

OV = Exponent Overflow; Floating Point only 

UN = Exponent Underflow; Floating Point only 

SC = Carry Output; Integer only 

RO = Reserved Operand; Floating Point only 
SDZ = Integer Divide by Zero 
FDZ = Floating Point Divide by Zero 

Logical Immediates include AND, OR, Exclusive OR, and Logical Shift 
Scalar/Immediate instructions, which affect no flags in the PSW. 

10.5 Eush/Eflp Scalar Registers 

These instructions- -Push Scalar Register and Pop Scalar Register- -operate 
on word and longword and do not affect the flags in the PSW. 

10.6 Seal ar/Sca lac C.o.mj2axss Signed/ Unsigne d 

This section includes Compare Scalar/Scalar and Compare Scalar/Scalar 
Unsigned, and the following S Register Compare Immediates: Compare 
Scalar/Immediate and Compare Scalar/Immediate Unsigned. Compare 
Scalar /Scalar works with byte, halfword, word, longword, single and double 
precision floating point and affects only the SC bit in the PSW. The Com- 
pare Scalar /Scalar Unsigned instruction can be byte, word, halfword, or 
longword and likewise affects only the SC bit in the PSW. S Register Com- 
pare Immediate instructions provide immediate to S register operations. 
The immediates provided are either 16 or 32 bit and affects only the SC 
bit. 



10.7 S Re gist er Conversions 

This instruction provides conversions from one S register to another. 
Integer Overflow is signified by the SIV bit, and Floating Point Overflow 
by the OV bit in the PSW. 
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10 . 8 ShJLf ts/Mflyss/Cfluiits 

These instructions include Logical Shift Scalar/Scalar, Trailing Zero 
Count, Population Count Scalar, Move Scalar to Address, Move 
Address/Scalar, and Move Scalar /Scalar . None of these instructions affects 
the flags in the PSW. 
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LOAD SCALAR REGISTER 



Id. (b|h|wjl|s|d) <effa>,Sk 



Purpose : 



To load an operand into a scalar accumulator. 



Formats 



I Opcode |@|L|Aj |Sk | 
15 8,7 6,5,3,2 



| Displacement | 
(31 | 15) 



Operation: 

Sk = c (Effective Address) 

PSW: 

Exceptions : 

Opcode : 

ld.b <effa>,Sk 001100000 

ld.h <effa>,Sk 001100010 

ld.w <effa>,Sk 001100100 

ld.1 <effa>,Sk 001100110 

ld.s <effa>,Sk 001100100 

ld.d <effa>,Sk 001100110 



Load scalar byte 
Load scalar halfword 
Load scalar word 
Load scalar longword 
Load scalar single float 
Load scalar double float 



Description : 

The referenced data is loaded into the specified scalar register. 



Notes : 



1 Single precision floating point and 32 bit integer 
occupy the same bit positions (<31..0>) within a scalar 
register . 

2 Byte data occupies bit positions <7. .0> within a scalar 
register . 

3 Halfword data occupies bit positions <15..0> within a 
scalar register. 

4 The .s and .w forms of this instruction are equivalent, 
as are the . d and . 1 forms . The . s and . d forms are 
added for convenience . 
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STORE SCALAR REGISTER 



st. (b|h|w|l|s|d) Sk,<effa> 



Purpose : 



To store an operand from a scalar register. 



Eormat : 



1 Opcode |@|L|Aj |Sk | 
15 8,7 6,5,3,2 



| Displacement | 
(31 | 15) 



Operation : 

c (Effective Address) = Sk 

PSW: 

Exceptions : 

Opcode : 



st.b Sk,<effa> 
st.h Sk,<effa> 
st.w Sk,<effa> 
st.l Sk,<effa> 
st.s Sk,<effa> 
st.d Sk,<effa> 



001101000 
001101010 
001101100 
001101110 
001101100 
001101110 



Store scalar byte 
Store scalar half word. 
Store scalar word 
Store scalar longword 
Store scalar single float 
Store scalar double float 



Description : 

The referenced data is stored from the specified scalar register Sk 
into the referenced memory location. 



Notes: 



1 Single precision floating pointer and 3 2 -bit integer 
occupy the same bit positions within a scalar register. 

2 The . s and .w forms of this instruction are equivalent, 
as are the . d and . 1 forms . The . s and . d forms are 
added for convenience . 
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LOAD SCALAR/IMMEDIATE 



Id. (w|l|d|u|du|dl|lu|ll) #N,Sk 



Purpose : 



To load a word immediate into a scalar register. 



Format: 



| Opcode |L| 001 | Sk | 
15 7 6 5 3,2 



Operation : 

Sk<63..32> = Immediate 
Sk<31. .0> = 



N 



31116 



! ld.d,#N,Sk 



! ld.l,#N,Sk 



Sk<63..32> - Immediate<31> 
Sk<31 . . 0> = Immediate 

! Id. (w|dl|ll),#N,Sk 
Sk<63..32> = Sk<63..32> ! (unchanged) 
Sk<31 . . 0> = Immediate 

! ld.(u|du|lu) #N,Sk 

Sk<63 . . 32> =' Immediate 

Sk<31..0> = Sk<31..0> ! (unchanged) 



PSW: 

Exceptions : 
Opcode : 



ld.d #N,Sk 
ld.l #N,Sk 
ld.w #N,Sk 
ld.u #N,Sk 

ld.du #N,Sk 
ld.dl #N,Sk 
ld.lu #N,Sk 
Id. 11 #N,Sk 



000100000 Load immediate, most significant bits 

000100010 Load 64 bit sign extended immediate 

000100011 Load a 32 bit immediate 

000100001 Load immediate, upper half 

000100001 Load 64 bit floating immed. , upper hal] 

000100011 Load 64 bit floating immed., lower halJ 

000100001 Load 64 bit integer immed., upper half 

000100011 Load 64 bit integer immed., lower half 



Description : 

The immediate is loaded into Sk. 



Notes : 



1. A full 64-bit immediate can be developed by first 
performing a ld.d,#N,Sk and then by executing a 
ld.w,#N,Sk. 
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The forms using .du, . lu, .dl, and .11 allow con- 
venient use of 64 bit immediate operators, and load 
either the upper half (.du and . lu) or the lower 
half (.dl and .11), respectively. For example, 
"ld.du #3.14159, sO" causes the upper half of the 64 
bit floating point constant PI to be loaded into 
the upper half of register sO. These four forms 
are necessary since immediate operands are only 32 
bits in length. 
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ADD SCALAR/SCALAR add. (b |h| w| 1 |s |d) S j , Sk 



Purpose : 

To add a scalar to a scalar. 

Format : 

| Opcode | Sj | Sk | 

15 6,5 3,2 

Operation : 

Sk = Sk + Sj 

PSW: 

SIV = Integer Overflow! Integer Only 

OV = Exponent Overflow! Floating Point only 

UN = Exponent Underflow! Floating point only 

SC = Carry Output! Integer Only 

RO = Reserved Operand! Floating point only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

add.b Sj,Sk 0101100100 Add scalar/scalar Integer byte 

add.h Sj,Sk 0101100101 Add scalar/scalar integer halfword 

add.w Sj,Sk 0101100110 Add scalar/scalar integer word 

add.l Sj,Sk 0101100111 Add scalar/scalar integer longword 

add.s Sj,Sk 0101010100 Add scalar/scalar single float 

add.d Sj,Sk 0101010101 Add scalar/scalar double float 

Description : 

The contents of the scalar register Sk are added to the con- 
tents of the scalar register S j , and the scalar result is 
loaded into Sk. 

Notes: 
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SUBTRACT SCALAR/SCALAR 



sub. (b|h|w|l |s|d) S j , Sk 



Purpose : 






To 


subtract a 


scalar from a 


F ormat : 






1 


Opcode 


| Sj | Sk | 


15 




6,5 3,2 


Operation: 






Sk 


= Sk - Sj 





PSW: 

SIV = Integer Overflow; Integer Only 

OV = Exponent Overflow; Floating Point Only 

UN = Exponent Underflow; Floating Point Only 

SC = Carry output 

RO = Reserved Operand Only; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode : 



sub.b Sj,Sk 
sub.h Sj,Sk 
sub.w Sj,Sk 
sub.l Sj,Sk 
sub.s Sj,Sk 
sub.d Sj,Sk 



0101101100 
0101101101 
0101101110 
0101101111 
0101010110 
010101O111 



Subtract 
Subtract 
Subtract 
Subtract 
Subtract 
Subtract 



scalar/scalar 
scalar /scalar 
scalar /scalar 
scalar /scalar 
scalar/scalar 
scalar/scalar 



integer byte 
integer halfword 
integer word 
integer longword 
single float 
double float 



Description : 

The contents of the scalar register Sj are subtracted from the 
contents of the scalar register Sk, and the scalar result is 
loaded into Sk. 

Notes: 
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MULTIPLY SCALAR/SCALAR 



mul. (b|h|w|l|s|d) Sj,Sk 



Purpose: 



To multiply a scalar by a scalar. 



Format: 



Opcode 



| Sj | Sk | 



15 

Operation: 

Sk = 

PSW: 



6,5 3,2 



Sk 



Sj 



SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only 
UN = Exponent Underflow; Floating Point Only 
RO = Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode : 



mul.b Sj,Sk 
mul.h Sj,Sk 
mul.w Sj,Sk 
mul .1 S j , Sk 
mul . s S j , Sk 
mul • d S j , Sk 



0101110100 Multiply scalar/scalar 

0101110101 Multiply scalar/scalar 

0101110110 Multiply scalar/scalar 

0101110111 Multiply scalar/scalar 

0101011100 Multiply scalar/scalar 

0101011101 Multiply scalar/scalar 



integer byte 
integer halfwor 
integer word 
integer longwor 
single float , 
double float 



Description : 

The contents of the scalar register Sk are multiplied with the 
contents of the scalar register S j , and the scalar result is 
loaded into Sk. 

Notes : 
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DIVIDE SCALAR/SCALAR 



div. (b|h|w|l |s|d) S j , Sk 



Purpose: 



To divide a scalar by a scalar. 



Format: 



| Opcode 


| Sj | Sk 


15 


6,5 3,2 


Operation: 




Sk = Sk / Sj 





PSW: 

SIV = Integer Overflow! Integer only 

OV = Exponent Overflow! Floating Point only 

UN = Exponent Underflow! Floating Point only 

SDZ = Divide by zero! Integer only 

RO = Reserved Operand! Floating Point only 

FDZ= Divide by zero! Floating Point only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode : 



div.b Sj,Sk 
div.h Sj,Sk 
div.w Sj,Sk 
div.l Sj,Sk 
div.s Sj,Sk 
div.d Sj,Sk 



0101111100 
0101111101 
0101111110 
0101111111 
0101011110 
0101011111 



Divide 
Divide 
Divide 
Divide 
Divide 
Divide 



scalar/scalar 
scalar/scalar 
scalar /scalar 
scalar/scalar 
scalar/scalar 
scalar/scalar 



integer byte 
integer half word 
integer word 
integer longword 
single float 
double float 



Description : 

The contents of a scalar Sk are divided by a 
the result is loaded into Sk. 



scalar Sj, and 



Notes : 
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NEGATE SCALAR/SCALAR 



neg. (b|h|w|l |s|d) S j , Sk 



Purpose : 



To negate a scalar. 



Format: 



Opcode 



Sj 



Sk 



15 

Operation: 

Sk = O 

PSW: 



6,5 3,2 



Sj 



SIV = Integer Overflow; Integer only 

SC = Integer Carry Out. 

OV = Exponent Overflow; Floating Point only 

UN = Exponent Underflow; Floating point only 

RO = Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode 



neg.b Sj,Sk 
neg.h Sj,Sk 
neg.w Sj,Sk 
neg.l Sj,Sk 
neg.s Sj,Sk 
neg.d Sj,Sk 

Description : 

The algebraic negation of Sj is loaded into Sk. The result 
identical to subtracting Sj from 0. 

Notes : 



0110111100 


Negate 


seal ar/scal ar 


integer byte 


0110111101 


Negate 


scalar/scalar 


integer halfword 


0110111110 


Negate 


scalar/scalar 


integer word 


0110111111 


Negate 


scalar /scalar 


integer longword 


0110010110 


Negate 


scalar /scalar 


single float 


0110010111 


Negate 


scalar/scalar 


double float 



is 
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AND SCALAR/SCALAR and S j , Sk 



Purpose : 

To AND the contents of two scalars. 



Format: 




| Opcode 


1 Sj | Sk | 


15 


6,5 3,2 


Operation: 




Sk = Sk .AND. 


Sj 


PSW: 




Exceptions : 




Opcode : 





and Sj,Sk 0101001100 AND scalar/scalar 

Description : 

The contents of the Sk register are ANDed with the contents of 
the Sj register. The results of the AND are loaded into Sk. 
All 64-bits of each scalar register participate in the opera- 
tion. 

Notes : 
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OR SCALAR/SCALAR or Sj,Sk 



Purpose: 

To OR the contents of two scalars. 



Format: 




| Opcode 


| Sj | Sk | 


15 


6,5 3,2 


Operation : 




Sk = Sk .OR. 


Sj 


PSW: 




Exceptions : 




Opcode : 




or Sj,Sk 


0101001101 



OR scalar/scalar 

Description : 

The contents of the Sj scalar register are ORed with the con- 
tents of the Sk register. The results of the OR are loaded 
into Sk. All 64-bits of the scalar registers participate in 
the operation. 

Notes : 
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EXCLUSIVE OR SCALAR/SCALAR xor S j , Sk 



Purpose: 

To Exclusive OR the contents of two scalar registers. 



Format: 




| Opcode 


| Sj | Sk | 


15 


6,5 3,2 


Operation : 




Sk = Sk .XOR. 


Sj 


PSW: 




Exceptions : 




Opcode : 




xor S j , Sk 


0101001110 



Exclusive OR scalar/scalar 

Description: 

The contents of the Sj vector register are exclusive ORed with 
the contents of the Sk register. The results of the exclusive 
or are loaded into Sk. All 64-bits of the scalar registers 
participate in the operation. 

Notes : 
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COMPLEMENT SCALAR/SCALAR not S j , Sk 



Purpose : 

To COMPLEMENT the contents of a scalar. 

Format: 

| Opcode | Sj | Sk | 
IS 6,5 3,2 O 

Operation : 

Sk = .NOT. Sj 

PSW: 

Exceptions : 

Opcode : 

not Sj.Sk 0101001111 Complement scalar/scalar 

Description : 

The contents of the Sj scalar register are complemented and 
the results loaded into, the Sk scalar register. All 64-bits 
of the scalar registers participate in the operation. 

Notes : 
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ADD SCALAR/IMMEDIATE 



add.(h|w|s) #N,Sk 



Purpose: 



To add an immediate to a scalar. 



Format: 



| Opcode |L| 001 | Sk | 
15 7 6 5 3,2 



Operation : 

Sk = Sk + Immediate 

PSW: 

SIV = Integer Overflow 
OV = Exponent Overflow 
UN = Exponent Underflow 
SC = Carry Output 
RO = Reserved Operand 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode : 



I N 
31116 



Integer Only 
Floating Point only 
Floating point only 
Integer Only 
Floating point only 



add.h #N,Sk 
add.w #N,Sk 
add.s #N,Sk 



000101000 Add scalar /immed. integer halfword 

000101001 Add scalar /immed. integer word 
000110000 Add scalar/immed. single float 



Description : 

The contents of the scalar register Sk are added to the con- 
tents of immediate, and the scalar result is loaded into Sk. 
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SUBTRACT SCALAR/IMMEDIATE sub . (h | w | s) #N, Sk 



Purpose : 

To subtract an immediate from a scalar. 

Format: 



| Opcode |L| 001 | Sk | 


1 


N 


1 


15 7 6 5 3,2 

Operation : 

Sk = Sk - Immediate 


31|16 








PSW: 

SIV = Integer Overflow; Integer Only 

OV = Exponent Overflow; Floating Point Only 

UN = Exponent Underflow; Floating Point Only 

SC = Carry output 

RO = Reserved Operand Only; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

sub.h #N,Sk 000101010 Subtract scalar/immed. Integer half won 

sub.w #N,Sk 000101011 Subtract scalar/immed. integer word 

sub.s #N,Sk 000110001 Subtract scalar/immed. single float 

Description : 

The contents of the immediate are subtracted from the contents 
of the scalar register Sk, and the scalar result is loaded 
into Sk. 

Notes : 
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MULTIPLY SCALAR/IMMEDIATE mul . (h | w | s) #N , Sk 



Purpose : 

To multiply a scalar by a scalar. 

Format: 

| Opcode |L| 001 | Sk | | N | 

15 7 6 5 3,2 31116 



Operation : 

Sk = Sk * Sj 

PSW: 

SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only 
UN = Exponent Underflow; Floating Point Only 
RO = Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

mul.h #N,Sk 000101100 Multiply scalar/immed. integer halfword 

mul.w #N,Sk 000101101 .Multiply scalar/immed. integer word 

mul.s #N,Sk 000110010 Multiply scalar/immed. single float 

Description : 

The contents of the scalar register Sk are multiplied with the 
contents of the scalar register Sj, and the scalar result is 
loaded into Sk. 

Notes : 
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DIVIDE SCALAR/IMMEDIATE div . (h | w | s) #N, Sk 



Purpose: 

To divide a scalar by a scalar. 

Format : 



| Opcode |L| 001 | Sk | | N | 
15 7 6 5 3,2 31 | 16 

Operation : 

Sk = Sk / Immediate 

PSW: 

SIV = Integer Overflow; Integer only 

OV = Exponent Overflow; Floating Point only 

UN = Exponent Underflow; Floating Point only 

SDZ = Divide by zero; Integer only 

RO = Reserved Operand; Floating Point only 

FDZ = Divide by zero; Floating Point only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode: 

div.h #N,Sk 000101110 Divide scalar /scalar integer halfword 

div.w #N,Sk 000101111 Divide scalar/scalar integer word 

div.s #N,Sk 000110011 Divide scalar/scalar single float 

Description: 

The contents of a scalar Sk are divided by an immediate, and 
the result is loaded into Sk. 

Notes: 
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AND SCALAR/IMMEDIATE and #N,Sk 



Purpose : 

To AND the contents of a scalar and an immediate 

Format : 

| Opcode |L| 001 | Sk | | N | 

15 7 6 5 3,2 31 | 16 

Operation: 

Sk<31..0> = Sk<31..0> .AND. Immediate 

PSW: 

Exceptions : 

Opcode : 

and #N,Sk 000100100 AND scalar/immediate 

Description! 

The contents of the Sk vector register are ANDed with the 
immediate. The results of the AND are loaded into Sk. The 
least significant 32 bits of Sk participate in the operation. 

Notes : 

1 16-bit immediates are sign-extended to 32 bits. 
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OR SCALAR/IMMEDIATE or #N/Sk 



Purpose: 

To OR the contents of a scalar and an immediate 

Format : 

| Opcode |L| 001 | Sk | | N | 
15 7 6 5 3,2 31 | 16 

Operation : 

Sk<31..0> = Sk<31..0> .OR. Immediate 

PSW: 

Exceptions : 

Opcode : 

or #N,Sk 000100101 OR scalar/immediate 

Description : 

The immediate is ORed with contents of the Sk register. The 
results of the OR are loaded into Sk. The least significant 32 
bits of Sk participate in the operation. 

Notes : 

1 16-bit immediates are sign-extended to 32 bits. 
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EXCLUSIVE OR SCALAR/IMMEDIATE xor #N,Sk 



Purpose : 

To Exclusive OR the contents of an immediate with a scalar 

Format: 

| Opcode |L| 001 | Sk | | N | 

15 7 6 5 3,2 31 | 16 

Operation: 

Sk<31..0> = Sk<31..0> .XOR. Immediate 

PSW: 

Exceptions : 

Opcode : 

xor #N,Sk 000100110 Exclusive OR scalar /immediate 

Description: 

The contents of an immediate are exclusive ORed with the con- 
tents of the Sk register. The results of the exclusive or are 
loaded into Sk. The least significant 32 bits of Sk partici- 
pate in the operation. 

Notes : 

1 16-bit immediates are sign-extended to 32 bits. 
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LOGICAL SHIFT SCALAR/IMMEDIATE 



shf #N, Sk 



Purpose: 



To logically shift the contents of a scalar as 
an immediate 



controlled by- 



Format : 



| Opcode |L| 001 | Sk | | M 

15 7 6 5 3,2 31 | 16 

Operation: 

Sk<63..0> = Shift Sk by immediate<7. .0> 

PSW: 
Exceptions : 



Opcode : 



shf #N,Sk 



000100111 



Shift Scalar/immediate 



Description : 

The contents of the Sk scalar register are shifted according 
to the value specified in the immediate. Only bits<7..0> are 
used to control the shift. The shift count is interpreted as 
an 8-bit two's complement number. Thus, if bit<7> is a 0, the 
logical shift is left. If bit<7> is a 1, the logical shift is 
to the right. Vacated positions are zero filled. 
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PUSH SCALAR REGISTER 



psh. (w|l) Sk 



Purpose : 



To push an Sk register onto the stack . 



Format: 



Opcode 



I Sk | 
3,2 



15 



Operation: 

IF (PSH. W- OPCODE) THEN 

AO = AO - 4 
ELSE 

AO = AO - 8 
ENDIF 

c(AO) = Sk<63..0> 
c(AO) = Sk<31. .0> 

PSW: 

Exceptions : 

Opcode : 



! psh.l 

! if long 
! if word 



psh.w Sk 
psh.l Sk 



0111110100100 
0111110100101 



Push Sk<31..0> onto the stack 
Push Sk<63..0> onto the stack. 



Description : 

The scalar register, Sk, is pushed onto the stack. Either all 
64 bits of Sk or the least significant 32 bits of Sk are 
pushed . 

Notes : 
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POP SCALAR REGISTER pop.(w|l)Sk 



Purpose : 

To pop an Sk register from the stack. 

Format : 

| Opcode | Sk | 

15 3,2 

Operation : 

IF (POP. W- OPCODE) THEN 

Sk<31. .0> = c(AO) 

AO = AO - 4 
ELSE 

Sk = c(AO) 

AO = AO - 8 ! pop.l 
ENDIF 

PSW: 

Exceptions : 

Opcode : 

pop.w Sk 0111110100110 Pop Sk<31..0> from the stack 
pop.l Sk 0111110100111 Pop Sk<63..0> from the stack. 

Description : 

The scalar register, Sk, is popped from the stack. Either all 
64 bits or the least significant 32 bits of Sk are loaded from 
the stack. 

Notes : 
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COMPARE SCALAR/SCALAR 



(le|lt|eq) . (b |h|w| 1 |s |d) S j , Sk 



Purpose: 



To sign compare a scalar and scalar and load SC 



Format: 



Opcode 



15 
Operation : 



I Sj | Sk | 
6,5 3,2 



IF (Sj .OPCODE -TEST. Sk) THEN 

SC = 1 
ELSE 

SC = 
ENDIF 



PSW: 



SC is affected (see above) 



Exceptions : 

Reserved Operand (floating point only) 



Opcode : 



le.b Sj,Sk 0100110100 Compare less than or equal byte 

lt.b Sj,Sk 0100111100 Compare less than byte 

eq.b Sj,Sk OlOOOlllOO Compare equal byte 

le.h Sj,Sk 0100110101 Compare less than or equal halfword 

lt.h Sj,Sk 0100111101 Compare less than halfword 

eq.h Sj,Sk 0100011101 Compare equal halfword 

le.w Sj,Sk 0100110110 Compare less than or equal word 

lt.w Sj,Sk 0100111110 Compare less than word 

eq.w Sj,Sk 0100011110 Compare equal word 

le.l Sj,Sk 0100110111 Compare less than or equal longword 

lt.l Sj,Sk 0100111111 Compare less than longword 

eq.l Sj,Sk 0100011111 Compare equal longword 

le.s S j , Sk 0101010000 Compare less than or equal single float 

lt.s Sj,Sk 0101010010 Compare less than single float 

eq.s S j , Sk 0101011000 Compare equal single float 

le.d Sj,Sk 0101010001 Compare less than or equal double float 

lt.d Sj,Sk 0101010011 Compare less than double float 

eq.d Sj,Sk 0101011001 Compare equal double float 
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Description: 

The scalar registers Sk and Sj are signed compared. The result 
is loaded into SC. If the comparison is .TRUE., then SC is set 
to 1. If the comparison is .FALSE., then SC is reset to 0. 

Notes : 
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COMPARE SCALAR/SCALAR UNSIGNED 



(le|lt)u. (b|h|w|l) Sj,Sk 



Purpose: 



To unsigned compare a scalar and scalar and load SC 



Format: 



Opcode 



15 



| Sj | Sk | 
6,5 3,2 



Operation: 

IF (Sj .OPCODE -TEST. Sk) THEN 

C = 1 
ELSE 

C = O 
ENDIF 

PSW: 

SC is affected (see above) . 

Exceptions : 

Opcode : 



leu.b Sj,Sk 
ltu.b Sj,Sk 

leu.h Sj,Sk 
ltu.h Sj,Sk 

leu.w Sj,Sk 
ltu.w Sj,Sk 

leu.l Sj,Sk 
ltu.l Sj,Sk 



0100100100 
0100101100 

0100100101 
0100101101 

0100100110 
0100101110 

0100100111 
0100101111 



Compare less than or equal byte 
Compare less than byte 

Compare less than or equal halfword 
Compare less than halfword 

Compare less than or equal word 
Compare less than word 

Compare less than or equal longword 
Compare less than longword 



Description : 

The scalar registers Sk and Sj are unsigned compared. The 
result is loaded into SC. If the comparison is .TRUE., then SC 
is set to 1. If the comparison is .FALSE., then SC is reset 
to 0. 
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Notes: 



1 Unsigned compares for equality are performed using 
the signed compares . 
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COMPARE SCALAR/IMMEDIATE 



(le|lt|eq) . (h|w|s) #N,Sk 



Purpose : 

To signed compare a scalar and an immediate and load SC 

Format: 

| Opcode |L| 001 | Sk | | N | 

15 7 6 5 3,2 31 | 16 

Operation: 

IF (Ak .OPCODE-TEST. Immediate) THEN 

SC=1 
ELSE 

sc=o 

ENDIF 



PSW: 



SC is affected (see above) . 



Exceptions : 

Reserved Operand (floating point only) . 



Opcode : 



le.h #N,Sk 000111100 Compare less than or equal halfword 

lt.h #N,Sk 000111110 Compare less than halfword 

eq.h #N,Sk 000110110 Compare equal halfword 

le.w #N,Sk 000111101 Compare less than or equal word 

lt.w #N,Sk 000111111 Compare less than word 

eq.w #N,Sk 000110111 Compare equal word 

le.s #N,Sk 000110100 Compare less than or equal single 

lt.s #N,Sk 000110101 Compare less than single 



Description : 

The scalar register Sk and the immediate are signed compared. 
The result is loaded into SC. If the comparison is .TRUE., 
then SC is set to 1. If the comparison is .FALSE., then SC is 
reset to 0. 
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Notes : 

The signed compare for equality for single precision float is 
NOT provided. Reserved operand detection for equality can be 
achieved by performing a compare word immediate with the 
immediate being a reserved operand. Then a compare word 
instruction can be used. 
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COMPARE SCALAR/IMMEDIATE UNSIGNED 



(le|lt)u. (h|w) #N,Sk 



Purpose : 

To unsigned compare a scalar and an immediate. 

Format: 

| Opcode |L| 001 | Sk | | N | 

15 7 6 5 3,2 31 | 16 

Operation : 

IF ( Immediate .OPCODE -TEST. Sk) THEN 
C = 1 



ELSE 
ENDIF 



C = 



PSW: 



SC is affected (see above) 
Exceptions : 
Opcode : 



leu.h #N,Sk 
ltu.h #N,Sk 

leu.w #N,Sk 
ltu.w #N,Sk 



000111000 
000111010 

000111001 
000111011 



Compare unsigned less than or equal halfworci 

Compare unsigned less than halfword 

Compare unsigned less than or equal word 

Compare unsigned less than word 



Description : 

The scalar register Sk and an immediate are unsigned compared. 
The result is loaded into SC. If the comparison is .TRUE., 
then SC is set to 1. If the comparison is .FALSE., then SC is 
reset to 0. 



Notes : 



1 Compare equal unsigned is performed using the com- 
pare signed instruction. 
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CONVERT SCALAR/SCALAR 



cvt Sj,Sk 



Purpose : 



To convert the integer or floating point contents of one S 
register to an integer or floating point value of different 
precision. 



Format: 



| Opcode 
15 



| Sj | Sk | 
6,5 3,2 



Operation : 

Sk = Convert (Sj) 

PSW: 

SIV = Integer Overflow 
OV = Exponent Overflow 
RO = Reserved Operand 

Exceptions : 

Integer Overflow 
Exponent Overflow 



! according to the opcode 



Opcode : 



cvtw.s Sj,Sk 
cvts.w Sj,Sk 
cvtd.s Sj,Sk 
cvts.d Sj,Sk 

cvtw.b Sj,Sk 
cvtw.h Sj,Sk 
cvtb.w Sj,Sk 
cvth.w Sj,Sk 

cvts .1 S j , Sk 
cvtd.l Sj,Sk 
cvtl.s Sj,Sk 
cvtl . d S j , Sk 

cvtl.w Sj,Sk 
cvtw.l Sj,Sk 



0100001000 Convert word to single float 

0100001001 Convert single float to word 

0100001010 Convert double float to single float 

0100001011 Convert single float to double float 

0100000100 Convert word to byte 

0100000101 Convert word to halfword 

0100000110 Convert byte to word 

0100000111 Convert halfword to word 

0100001100 Convert single float to longword 

0100001101 Convert double float to longword 

0100001110 Convert longword to single float 

0100001111 Convert longword to double float 

0100010100 Convert longword to word 

0100010101 Convert word to longword 



Description: 

The specified integer or floating point operand is converted 
to specified destination data type and the result is loaded 
into Sk. An overflow exception can occur for the word to byte 
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and word to halfword instructions. Bytes and half words are 
sign extended to words. Conversions from float to fix use 
truncation (rounding towards 0) as the rounding algorithm. 



Notes ; 



The other possible signed conversions are imple- 
mented by executing two of the provided convei — 
sions. Halfword to byte is performed by cv.hw and 
cv.wb. Byte to halfword is performed by cv.bw and 
cv . wh . 

Unsigned conversions may be implemented using logi- 
cal AND instructions. 

Only the specified precision of the destination 
operand is modified. All other bits are unchanged. 
If an input operand is a floating point reserved 
operand, then the destination is unchanged, and the 
R0 and SIV flags are both set to 1. 

Truncation from float to fix follows the FORTRAN 
standard. Thus, -5.9 is truncated to -5, and 5.9 
is truncated to 5. 

Conversion from integer to floating point types is 
performed thus: 

a. The fixed point number is normalized. 

b. The most significant 24 bits (for single) or the 
most significant 53 bits (for double) of the nor- 
malized fixed point number become the fraction of 
the result. The lower order bits of the normalized 
fixed point number are truncated. 
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LOGICAL SHIFT SCALAR/SCALAR shf s j , Sk 



Purpose : 

To logically shift a scalar. 

Format: 

| Opcode | Sj | Sk | 

IS 6,5 3,2 O 

Operation: 

Sk = Shift Sk by Sj<7..0> 

PSW: 

Exceptions: 

Opcode : 



shf Sj,Sk O1O1OO0101 Shift a scalar 

Description: 

The contents of Sk are shifted according to the contents of 
Sj. When Sj is positive, Sk is shifted left. When Sj is nega- 
tive, Sk is shifted right. All 64 bits of Sk participate in 
the shift. Vacated positions are zero filled. Only Sj<7. .0> 
are used to control the shift. Sj<63..8> are ignored. 



Notes : 



Arithmetic shifts are implemented using multiplies and 
divides . 
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TRAILING ZERO COUNT tzc Sj,Sk 



Purpose: 

To determine the number of trailing zeros in a scalar regis- 
ter. 

Format: 

| Opcode | Sj | Sk | 

15 6,5 3,2 

Operation : 

Sk = Trailing-zero-count (Sj) 

Exceptions : 

Opcode : 

tzc Sj,Sk 0100010111 Count of trailing zeroes in Sj 

Description : 

The number of trailing zeros in Sj counting from bit through 
bit 63 are loaded into bits<6..0> of Sk. The tzc instruction 
searches for the first binary 1 from right to left, the same 
way bits are numbered. If Sj is all 0, the number 64 is 
loaded into Sk<63..0>. Otherwise, if bit n is set, then the 
binary value of n is loaded into Sk<63..0>. 
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POPULATION COUNT SCALAR 



plc.t Sj,Sk 



Purpose : 

To determine the number of l's in a scalar register. 

Format: 

| Opcode | S j | Sk | 
15 6,5 3,2 

Operation: 

Sk = 

DO 10 a = 0, 63 

IF (Sj<a> .EQ.l) THEN 
Sk =Sk + 1 

ENDIF 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

plc.t Sj,Sk 0100010110 Count the number of l's in Sj 

Description : 

The number of l's in Sj are loaded into bits<6..0> in Sk. 
All other bits of Sk are reset to 0. 

Notes : 
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MOVE SCALAR/ADDRESS n, ov Sj,Ak 



Purpose : 

To move a scalar register into an address register. 

Format : 

| Opcode | Sj | Ak | 

15 6,5 3,2 

Operation : 

Ak = Sj<31. .0> 

PSW: 

Exceptions : 

Opcode : 



mov Sj,Ak 0101000011 Move 32 bits of Sj into Ak. 

Description : 

Move the least significant 32 bits of Sj into Ak. The most 
significant 32 bits of Sj are ignored. 

Notes : 
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MOVE ADDRESS/SCALAR mov Aj , Sk 



Purpose : 

To move an address register into a scalar register. 

Format: 

| Opcode | Aj | Sk | 

15 6,5 3,2 

Operation : 

Sk<31. .0> = Aj 

Sk<63..32> = Sk<63...62> ! unchanged 

PSW: 

Exceptions : 

Opcode : 

mov Aj,Sk 0101000111 Move an address to a scalar 

Description : 

The Aj address register is moved into the least significant 32 
bits of Sk. The most significant 32 bits of Sk are unchanged. 

Notes : 
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MOVE SCALAR/SCALAR 



mov. (w| 1 |s |d) Sj,Sk 



Purpose : 



To move one scalar register to another 



Format: 



Opcode 



15 

Operation: 

Sk = Sj 
Sk<31. .0> 

PSW: 



Sj 



Sk 



6,5 3,2 

! longword 
Sj<31. .0> 



! word 



Exceptions : 



Opcode s 



mov.l Sj,Sk 
mov . w S j , Sk 
mov . d S j , Sk 
mov . s S j , Sk 



0101000110 Move scalar register longword 

0101000100 Move scalar register word 

0101000110 Move scalar register single float 

0101000100 Move scalar register double float 



Description: 

The contents of the specified portion of scalar register Sj 
are moved to Sk. Sj and any unreferenced portion of Sk remain 
unchanged . 



Notes : 



1 The . s and .w forms of this instruction are 
equivalent, as are the .d and .1 forms. The .s and 
.d forms are added for convenience. 
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CHAPTER 11 
ll Eragram Control Instruct icn Ss£ 



The instructions defined in this chapter alter the program counter (PC) . 
Alterations of the program counter may occur as a result of a branch per- 
formed after a comparison, an unconditional jump, a subroutine call or 
return, or an operating system call or return. A return from interrupt 
processing is also defined. 



11 • l Branches/ Jumps 

These instructions include Branch on PSW Bit, Jump, and Breakpoint. The 
branch instructions provide a convenient way to branch relative to the 
present value of the PC. All branch instructions are 16-bits in length. 
Jump instructions load the program counter with an effective address. The 
effective address is developed as an operand address. Only Breakpoint 
affects the flags in the PSW, as follows: 

FRL, C, AIV, ADZ, SC, SIV, SDZ, OV, UN, FDZ, RO = 

When the PC increments to reference the next sequential instruction or a 
new value is loaded into the PC, the PC is altered in the following way for 
branches and jumps. If the current ring is 4, bits<30. .1> are loaded. If 
the current ring is not ring 4, blts<28..1> are loaded. In all cases, the 
branch or jump is constrained to be within the current ring. 



11.2 Subroutine Call/Sayg/Return 

This section describes the facilities provided for subroutine 
call/save/return. The general strategy is to provide the following: 

1 An efficient mechanism in both time and space. 

2 Flexibility in supporting precompiled argument packets or argu- 
ments pushed onto a stack. 

3 Passing arguments by reference (a byte pointer) or by value. 

4 Support of the notion of a common run-time environment and link- 
ing subroutines written in one or more languages. 

With these objectives in mind, the following facilities and/or methodolo- 
gies are supplied. 

1 An architecturally defined stack pointer, argument pointer, and 
frame pointer within the address register space. 

2 Three call instructions: call, calls, and callq. The calls 
instruction pushes minimal machine state. The call instruction 
pushes all the scalar machine state. FRL in the saved (pushed) 
PSW denotes the amount of machine state pushed. A third, callq, 
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is used for local subroutines that do not require the definition 
of a new stack frame. 
3 A set of instructions which push and pop registers from the 
stack. Also an instruction which pushes effective addresses onto 
the stack. 

The following instructions could be generated for the two most likely sub- 
routine invocation sequences: precompiled argument packets, and arguments 
pushed on the stack. 

Precompiled Arguments 

1 Ldea (the effective address) into AP the address of the packet. 

2 Call the subroutine. 

3 Rtn instruction executed in called subroutine's space. 

4 Load the former AP from the stack (the effective address) . 



Arguments Pushed onto the Stack 

1 Push arguments onto the stack. 

2 Move SP to AP (address of the argument packet in the stack) . 

3 Call the subroutine. 

4 Rtn instruction executed in the cal lee's space. 

5 Clear off the pushed arguments by an add.w,#(n|N) ,Ak instruc- 
tion . 

6 Load the former AP from the stack. 



11.3 Stack Structure/Return Blocks 

The structure of the stack for subroutine entry and exits is described 
below (where LSI means Language Specific Information) : 

There are 4 types of return blocks: 

* Short 

* Long 

* Extended 

* Context 



A short return block is formed as a result of executing a calls instruc- 
tion. The return address, psw, A7, and A6 are saved. (FRL<1 . . . 0>=11) . 

A long return block is formed as a result of executing a call instruction. 
The return address, psw, A7, A6, AS through Al, and S7 through SI are 
saved. AO and SO are not saved. (FRL<1 . . . 0>=10) . 

An extended return block is formed as a result of: system call (sysc) , 
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trap, or a breakpoint. The return block contains the return address, psw, 
all the A registers, and all the S registers. The stack pointer saved (AO) 
references the value of AO PRIOR to the saving of the extended return 
block. (FRL<1. . .0>=01) . 

A context block may be formed as a result of a system exception. The con- 
text block contains an extended return block plus internal machine state. 
This internal state will change for each implementation. A context block is 
pushed on the ring process stack . (FRL<1 . . . 0>=00) . 

Returning from each of these return blocks is implemented as follows: the 
rtn instruction is used for the short, long and extended return blocks. 
The rtnc instruction is used to return from system exceptions that save a 
context block. The structure of the stack is shown in Figure 11-1. 



11.4 Quick Calls and Return 

Instructions are provided for subroutine entry that only save and restore 
the program counter. The PSW and frame pointer are unaltered. The callq 
instruction pushes the address of the next instruction and branches to the 
subroutine. The rtnq pops the program counter value on top of the stack. 



11.5 System Call and Return 

These instructions are used to perform system calls and system returns, and 
perform ring crossings in a protected manner. The System Call instruction 
is used to perform the call, and the flags in the PSW are cleared to all 
zero. To perform a system return, the standard return instruction (rtn) is 
used, and the flags are loaded from the return block. 
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Figure 11-1: 
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BRANCH ON PSW BIT 



br | nop 



Purpose: 



To conditionally or unconditionally perform 
branch. 



a short PC relative 



Format: 

| Opcode | Displacement | 

15 8,7 

Operation : 

IF (test .EQ. .TRUE.) THEN 

PC = PC + sign-extended-displacement 
ELSE 

PC = PC+2 
END IF 



PSW: 



unchanged 




Exceptions : 




Opcode : 




nop 


01110000 


br 


01110001 


bri . f 


01110010 


bri.t 


01110011 


bra. f 


01110100 


bra. t 


01110101 


brs. f 


01110110 


brs.t 


01110111 



No Operation 

Branch Always 

Branch on ION false 

Branch on ION true 

Branch on address carry false 

Branch on address carry true 

Branch on scalar carry false 

Branch on scalar carry true 

Description : 

The specified condition is tested. If the tested condition is 
asserted, the 8-bit displacement is sign extended to 31 bits. 
These 31 bits are then added to bits<31..1> of the program counter. 

The program counter value added to contains the address of the 
branch instruction. If the tested condition is not asserted, then 
the next sequential instruction is executed. 
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Notes : 



These instructions are used for short PC relative 

branches. All branches are constrained to stay within 

the current ring (indicated by bits<31..29> of the PC) . 

Additional instructions exist to jump to any arbitrary 

instruction. See the instruction. 

The range of the branch instruction is +127 to -128 

half words, or +254 to -256 bytes. 

The displacement is unused for the instruction. 



Section 11.5 



11-6 



Program Control Instruction Set 
JUMP 



j mp | exit 



Purpose : 



To conditionally/unconditionally jump to an address. 



Format: 



| Opcode |@|L| Aj | Ak | 
15 8,7,6,5 3,2 



| Displacement | 
(31,15) 



Operation: 

IF (OPCODE -COND .EQ. .TRUE.) THEN 

PC = Effective Address 
ELSE 

PC = PC + Instruction Length 
END IE 



PSW: 



Unchanged 



Exceptions : 








Opcode : 








exit 






000000000 


jmp 




<effa> 


000000010 


jmpi. 


,f 


<effa> 


000000100 


jmpi. 


,t 


<effa> 


000000110 


jmpa, 


,f 


<effa> 


000001000 


jmpa. 


,t 


<effa> 


000001010 


jmps. 


,f 


<effa> 


000001100 


imps, 


,t 


<effa> 


000001110 



Error Exit Instruction 

Jump Always 

Jump on ION false 

Jump on ION true 

Jump on address carry false 

Jump on address carry true 

Jump on scalar carry false 

Jump on scalar carry true 

Description : 

The specified condition is tested. If the condition is asserted, 
the Program Counter is loaded with the effective address. If the 
condition is not asserted, the Program Counter is updated to refer- 
ence the next sequential instruction. 



For the error exit instruction, a system call to 
address C (hex) is performed. A class code of 
byte 2 of A5. Zero is loaded into byte 3 of AS. 



ring 0, byte 
is loaded into 



Notes : 



All jumps (except exit) are constrained to be within 
the current ring. That is, if the current ring is 4, the 
most significant bit of the effective address is 
ignored. If the current ring is 3,2,1, or the most 
significant 3 bits of the effective address are ignored. 
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The error exit instruction is an instruction whose 
opcode is all 0. It detects a common programming error 
(transfer to a page that was initialized to all by the 
OS) . 

For the error exit instruction, the L (length) bit is 
interpreted. This means that the error exit can have a 
16- or 32-bit displacement. The @ bit (bit 7) is not 
interpreted; thus, indirection can never occur for the 
error exit instruction. 
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BREAKPOINT bkpt 



Purpose: 

To jump to a debugger via a breakpoint call. 

Format: 

| Opcode | Ak | 

15 3,2 



Operation: 

An extended call is performed to the address contained in byte 
address 50 (hex) of the current ring. 



PSW: 



FRL= 00 
C,AIV,ADZ,SC,SIV,SDZ,OV,UN,FDZ,RO = 



Exceptions : 

Opcode : 

bkpt 0111110101010 Breakpoint 

Description: 

A subroutine call is executed. The cal lee's address is the current 
ring, byte address 50 (hex) . An extended return block is pushed on 
the user stack. Thus all the A and S registers are pushed onto 
the current stack. The PC saved in the return block references the 
instruction immediately following the bkpt instruction. 



Notes: 

1 The length of the bkpt instruction is 1 half word. Thus a 
bkpt instruction can be substituted for any instruction 
in the CONVEX architecture. 

2 The Ak field is not used. 
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CALL A SUBROUTINE (call|calls) <effa> 



Purpose : 

To call a subroutine. 

Format: 

| Opcode |@|L| Aj | Ak | | Displacement | 

15 8,7,6,5 3,2 (31,15) 

Operation : 

IF (CALLS-OPCODE) THEN 

PSW-FRL-BITS = 11 ! short frame 
ELSE 

PSW-FRL-BITS = 10 ! long frame 
END IF 

IF (CALL-OPCODE) THEN ! long frame 
PUSH SI . . S7 
PUSH A1..A5 !SP is not updated 

ENDIF 

PUSH A6 ! short and long frame 

PUSH A7 

PUSH PSW 

PUSH next instruction address 

PSW(FRL,C, SC,AIV,ADZ,UN,OV,FDZ,RO,SIV,SDZ)=0 

IF (CALLS-OPCODE) THEN 

AO = AO -16 ! short frame 
ELSE 

AO = AO -92 ! long frame 
END IF 

A7 = AO 

PC = Effective Address 



PSW: 



PSW<FRL>=00 

PSW<C, SC, AIV, ADZ, UN, OV, FDZ, RO, SIV, SDZ>=0 

Exceptions : 

Opcode: 

call <effa> 001000000 Call a subroutine, make a long frame 
calls <effa> 001000010 Call a subroutine, make a short frame 
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Description : 

A new stack frame is created. A short, frame is created for the 
calls instruction. A long frame is created for the call instruc- 
tion. SO is never saved. The FRL bits in the PSW saved indicate 
the frame size created. AO and A7 are updated to reference the new 
top of stack. The effective address of the call instruction then 
becomes the new value of the program counter. The trap enable bits 
of the PSW are propagated from the caller to the callee (i.e., left 
unchanged) . Thus, if the caller had floating point overflow traps 
enabled, the callee also has floating point trap enabled. The 
status bits of the callee 's PSW are reset to 0. 



Notes : 



1 A local area for variables is created by executing a 
sub.w #N,SP on entry to the called routine. 

2 The previous values of the frame and argument pointers 
are saved on the stack. 

3 Arguments are referenced with positive displacement from 
an argument pointer if defined by software convention. 

4 SO is propagated but not saved. Typically, SO is used 
to hold return values of functions (by software conven- 
tion) . . _ . . 

5 AO, the stack pointer, is not saved. AO, however, is 
updated to reference the new top of stack. 

6 All A and S registers (except AO and A7 -- these refer- 
ence the new top of stack) are propagated through the 
call. 

7 The Ak field is unused. 

8 The FRL bits in the PSW register are always reset to . 
To determine the type of frame created, the FRL bits of 
the PSW in the saved frame must be examined. 
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RETURN FROM SUBROUTINE rtn 



Purpose : 

To return from a subroutine. 



Format: 



I Opcode | Ak | 

15 3,2 

Operation: 

SP = FP ! remove local save area 

Unwind stack to previous frame (restore stack built by 

call, calls, sysc, or exception condition handler) . If 

this is an outward return, then the SP after the pop is 

stored into bytes <72..75> of page of the ring containing the rtn 

instruction . 



PSW: 

Load from current stack frame 

Exceptions : 

Ring Violation; Inward Return 

Ring Violation; Invalid Frame Length 

Opcode: 

rtn 0111110010010 Return from subroutine call 

Description : 

An exit from a subroutine is performed. FP is moved to SP to rees- 
tablish the top of the stack. It is assumed that the subroutine 
was entered using a call, calls instruction, sysc (system call), or 
an exception condition that pushed an extended return block. 

There are 4 types of return blocks: short, long, extended, and con- 
text. Short and long, are created as a result of a subroutine 
call; extended and context are created as a result of a system 
call, fault, or trap. When FRL=00 (indicating a context block), an 
rtnc (return from context) must be executed. When a ring crossing 
is encountered (as indicated by an extended return block and a 
saved PC whose ring field is not the current ring) , then the fol- 
lowing occurs. First, a check for outward ring crossing is made. 
If the ring crossing is inward, a system exception condition 
occurs. If outward, the extended return block is popped. The 
stack pointer after the pop is stored in bytes <72..75> of page 
of the ring containing the rtn instruction. 
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Notes : 



1 The FRL field from the PSW in the stack indicates the 
type of return block saved: ll=short, 10=long, 
01=extended, and O0=context. 

2 Ak is not used in this instruction. 

3 The rtnc instruction must be used to return when ERL=00, 
indicating a context block. 
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PUSH PROGRAM COUNTER 



callq <effa> 



Purpose : 



To push the PC onto the stack and branch. 



Format: 



| Opcode |@|L| Aj | Ak | 

15 8,7,6,5 3,2 

Operation: 

TEMP = PC-NEXT- INSTRUCTION 
PC = Effective Address 
PUSH TEMP 

PSW: 

Exceptions : 

Opcode : 



| Displacement | 
(31,15) 



callq <effa> 



001000100 



Push the program counter and jump 



Description: 

The address of the instruction immediately following the callq 
instruction is pushed onto the stack. The PC is then loaded with 
the calculated effective address. 



Notes : 



This instruction is used as a fast subroutine call when the current 
address context need not be altered. The rtnq instruction is used 
to return from the callq instruction. 
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POP PC and JUMP rtnq 



Purpose : 

To pop the top element of the stack and load the PC. 



Format: 




| Opcode 


1 Ak | 


IS 


3,2 


Operation: 




Temp = c(AO) 




AO = AO + 4 




IF (PC<31> = 


1) THEN 


PC<30 . . 0> 


= TEMP <30. .0> 


ELSE 




PC<28..0> 


= TEMP <28. .0> 


ENDIF 




r&w : 
Exceptions : 




Opcode : 





rtnq 0111110010000 Pop the program counter and jump 

Description: 

The top of the stack contains a previously pushed PC value. The PC 
is popped into the present PC, the stack is adjusted, and execution 
continues at the instruction referenced by the popped PC. 



Notes : 



The Ak field is not used. 

The current ring of execution does not change. 
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SYSTEM CALL sysc #r<#g 



Purpose: 

To perform a system call to ring #r. 

Format : 

| Opcode | L | 000 | Ak | | #r | | #g | 

15 7 6 5 4 3 31 29 28 16 15 

Operation : 

An inward or current ring crossing with protection checks is per- 
formed. Temp = PC+instruction length, where PC is the address of 
the sysc instruction. 

If the call is to the current ring, then the current SP is used. 
If the call is to an inner ring (#r) , then the stack for ring #r is 
used. The SP contained in page of the inner ring is used to 
define the base of the inner ring stack. 

FRL<1..0> = 01 ! indicates an extended frame 

Execute a call to ring #r, GATE_ARRAY (#g) . 

Get new stack pointer (as described above) . 

SP = SP - 104 ! Extended return block 

FP = SP 

PC<31. .29> = #r 

PC<28 . . 1> = GATE_ARRAY (#g) 



PSW: 

Cleared to all zero 

Exceptions : 

Ring Violation; Outward call 
Ring Violation; Invalid Gate 

Opcode : 

sysc #r,#g 000100001 Perform a system call 

Description : 

The #g field is used to reference an entry in a gate array. The #r 
field is the number of the called ring. The base of the gate array 
is referenced by bytes <76..79> of page zero of the ring named by 
#r. 

After the new ring stack is established, an extended return block 
is pushed. All of the A and S (including SO and A0) registers are 
saved, as well as the PSW and the address of the instruction 
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following this instruction (which is the system call instruction) . 

The #g field is used to obtain the address of the called instruc- 
tion for both intra (same ring) or inward ring crossing. 



Notes : 



1 The Ak field is not used. 

2 See the protection chapter for a more detailed explana- 
tion of stack switching and the structure of the gate 
array. 

3 The stack pointer saved in the extended return block 
references the top of stack of the caller's ring. 

4 If the ring to be called is 0, and the gate entry is 
less than 32,768/ then the immediate length can be 
specified as 16 bits. 

5 If L in the instruction is 0, the 16-bits immediately 
following the opcode are sign extended to 32 bits. 
These 32 bits are interpreted as the #r, #g fields. 
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CHAPTER 12 
12 EriYilssrsd Cantral/Siatus Instrustian Set 



This chapter contains the definition of the privileged instruction set. A 
privileged instruction is like any other CONVEX instruction with one 
difference: the current ring of execution must be 0. This means that 
bits<31..29> of the program counter must be all 0. Privileged instructions 
are used by the operating system kernel to control process multiplexing, 
virtual address space management, and system clock stores. 

Other instructions are also included in this chapter. These instructions 
generally are used by parts of the extended operating system contained in a 
ring other than 0. 

An attempt to execute a privileged instruction in a ring other than 
results in a system exception and the generation of a system call through 
byte address 0C (hex) of page of ring 0. 



12 . 1 Clocks 

A series of clocks are available to the operating system and user. These 
clocks are used for: 

1 Scheduling 

2 Maintaining a time chronometer 

3 Maintaining an alarm 

4 Measuring the time it takes for a program to execute. 

The clocks associated with items 2 and 3 are maintained external to the job 
processor. By addressing main memory locations appropriately, the time of 
day can be determined. 

Supporting items 1 and 4 are a 3 2 -bit interval timer register and 2 associ- 
ated support registers. Manipulation of all three of these registers is 
privileged. 

The 3 32 -bit registers are the: 

* Next Internal Timer Counter (NITC) 

* Interval Timer Counter (ITC) 

* Interval Timer Status Register (ITSR) 

The format of the NITC (Next Interval Timer Count) is: 
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I | Next Interval Timer Count | | 
31 28,27 8,7 

When ITC overflows, NITC is loaded into ITC to resume counting. 
The format of the ITC (Interval Timer Counter) is: 



I | Interval Timer Counter | 
31 28,27 8/ 7 



The ITC is a 32 -bit register that is incremented every microsecond in bit 
8. It is loaded from NITC when ITC becomes all 0. At that time an inter- 
rupt may also be generated (as controlled by ITSR) . 

The format of the ITSR in read mode is: 



|ON|INE|FULL|OVF| Reserved | 

31 30 29 28 27 o 

where the ITSR bits have the following meanings: 

* Bit 28 - OVF. When ITC overflows and if FULL is already set, 
then OVF is set to one. This condition occurs when 2 or more 
interval timer overflows occur without an interrupt being pro- 
cessed. OVF is cleared to when a value is loaded into ITSR. 

* Bit 29 - FULL. When ITC overflows, FULL is set to 1. If INE 
is also set to 1, then an interval timer interrupt occurs (vec- 
tors through byte address 10 (hex) of ring 0) . FULL is reset 
to when a value is loaded into ITSR. 

* Bit 30 - INE. When INE is set to 1 and bit 29 (Full) is set to 
1, an interval timer interrupt (vectors through byte address 10 
(hex) of ring 0) occurs. When INE is 0, no interrupt occurs. 

* Bit 31 - ON. When ON is reset to 0, ITC does not increment. 
When ON is set to 1, ITC increments every microsecond. 

For write mode, the ITSR will operate as follows: 
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31 30 29 28 
| ON | INE | DEC | --- 



The FULL and OVFL bits act as a pseudo two-bit counter, incremented by an 

overflow of the timer counter, and decremented by a write to the ITSR with 
DEC (bit 61) set to a one. OVF and FULL count as follows: 

Before WRITE After WRITE (DEC =1) 

EHLL QYE EJZLL QYE 



10 

11 10 



Note that two writes with DEC=1 will clear both FULL and OVF. A write to 
ITSR with DEC=0 will not affect the FULL and OVF bits. 

The interval timer interrupt handler should execute a store into the ITSR 
to clear FULL. An interval timer interrupt is masked out by either INE 
being reset to in the ITSR or by the ION flag being set to 1. When all 
three interval timer registers are concurrently moved to/from a scalar 
register, the format of Sk is: 



| ITSR | NITC | RES | ITC | RES | 
63 60 59 40 39 28 27 8 7 



12.2 Instruct ion Set 

The instructions defined in this chapter are: 
dsi 
eni 
ldsdr 
ldkdr 
patu 
pate 
rtnc 
pich 
plch 
Load ITR 
Store ITR 
Load ITSR 
Load CPUID 
xmti 
mski 
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halt 

mov Sk,W 

tstvv 
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LOAD PROCESS SDRs ldsdr Ak 



Purpose : 

To load values into the process SDR's. 

Format: 

| Opcode | Ak | 

15 3,2 

Operation : 

SDR(1 / 2, . . . ,7) = Effective Address (Ak) ! memory data must be 
resident and word aligned. 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode : 

ldsdr Ak 011111O0O0OOO Load process SDR's 

' Description : 

Ak contains the effective address of a 7 entry table. Each entry is 
a word. Entry 1 contains a value to be stored into SDR1. Entry 2, 
at (Ak) +4, contains a value to be stored into SDR2, and so on. Upon 
completing the store of the 7 table entries into SDR (1:7), the ATU, 
logical cache, and instruction cache are purged of any entries 
associated with segments 1-7. 

Notes : 

1 The data located in main memory to be loaded into the 
SDR's must be resident to ensure that an address trans- 
lation fault does not occur and must be aligned on 32- 
bit words. If the data are not resident or aligned, a 
machine exception occurs. 

2 The data located in main memory to be loaded into the 
SDR's must be word aligned on a 32 -bit boundary. If 
they are not, a machine exception occurs. 
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LOAD KERNEL SDRs ldkdr ^ 



Purpose: 

To load values in all 8 SDR's. 

Format: 

| Opcode | Ak | 

15 3,2 

Operation : 

SDR (0,1 7) = Effective Address (Ak) 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode : 

ldkdr Ak 0111110000001 Load all 8 SDR's 

Description : 

Ak contains the effective address of an 8 entry table. Each entry 
is a word. Entry is stored in SDRO. Entry 1 is stored into SDR1 
and so on. (Ak) references the table entry 0. (Ak) +4 references 
table entry 1. Upon completing the store of all 8 entries into 

SDR(0,1 7), the ATU, logical, and instruction caches are 

purged. If, prior to ATU purging, logical addressing equaled physi- 
cal addressing, then the ATU is enabled. Thus, all subsequent 
addresses are virtual . 



Notes i 



1 The data located in main memory which are to be loaded 
into the SDR's must be resident to ensure that no 
address translation fault occurs. If the data are not 
resident, a machine exception occurs. 
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2 The data located in main memory which are to be loaded 
into the SDR's must be word aligned on a 32-bit boun- 
dary, or a machine exception results. 

3 The addressing environment must be physical (the ATU 
must be turned off) . If the ATU is turned on, a machine 
exception occurs . 

4 Memory data must be resident and word aligned. 
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PURGE ATU 



patu 



Purpose : 

Purge the entire ATU. 

Format : 

| Opcode | Ak | 

15 3,2 

Operation: 

ATU_valid_bits = 
Purge L_cache 
Purge I_cache 

PSW: 

Exceptions: 

Ring Violation; Privileged Instruction 

Opcode : 

P atu 0111110000100 Purge the entire ATU 

Description: 

All the entries in the ATU are purged, which means that any virtual 
addresses that were encached are invalidated. The logical and 
instruction caches are also purged. 

Notes : 

1 The Ak field of the instruction is unused. 
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PURGE ATU ENTRY pate Ak 



Purpose : 

To purge an ATU entry. 

Format: 

| Opcode | Ak | 

15 3,2 



Operation : 

ATU_valid_bit (Ak) = 
Purge L_cache 
Purge I_cache 



PSW: 



Exceptions : 

Ring Violation; Privileged Instruction; class=8, qualifier=0 

Opcode : 

pate Ak 0111110000101 Purge ATU entry 

Description : 

Ak contains a virtual address. If there is an ATU entry associ- 
ated with the address in Ak, that ATU entry is purged (i.e., marked 
invalid) . All other ATU entires are left unchanged. 



Notes : 



1 This instruction is typically used when a pageframe is 
added to the working set of a process after an address 
translation fault. The page associated with the fault is 
found in physical memory but is not part of the 
process * s working set . Thus no I/O request is necessary 
to resolve the fault. 
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RETURN FROM CONTEXT BLOCK rtnc 



Purpose : 

To return from a context block. 

Format : 

| Opcode | Ak | 

15 3,2 

Operation: 

Pop context block using AO. 

Processor State = Context Block (ERL=00) . 

AO = AO + context block size 

Ring 0, bytes <36..39> = AO 

AO = stack pointer from context block. 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode : 

rtnc 0111110010101 Return from a context block 

Description : 

The context block on the ring stack is popped; then the new value 
of the stack pointer after the pop, A0, is stored into the context 
stack pointer in bytes <36..39> of ring 0. Finally, the stack 
pointer value contained within the context block just popped is 
loaded into A0. 



Notes : 



1 The entire processor state was stored in the context 
block at the time that the condition that initiated the 
exception occurred. The cause of the exception was 
loaded into AS after the context block was stored. This 
permits the operating system to recover from the excep- 
tion condition, if possible. 

2 The Ak field of the instruction is unused. 
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3 The processor context block is the entire hardware 
machine state at the time of the system exception condi- 
tion. After the OS resolves the exception (if it can) , 
the machine state (the context block) is restored. Then 
the faulted instruction can resume execution. 

4 The paradigm for system exception conditions is identi- 
cal to kernel calls, and differs only in the size of the 
saved return block, and the stack used. 

5 The processor does not check the FRL bits in the PSW. 
They are assumed to be 00. 
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ENABLE/DISABLE INTERRUPTS 



(eni | dsi) 



Purpose: 

To enable or disable interrupts. 

Format: 

I Opcode | k | 

15 3,2 

Operation : 

ion = 1 ! eni ion = ! dsi 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode : 

eni 0111110101000 Enable interrupts, set ion to 1 

dsi 0111110101001 Disable interrupts, reset ion to 

Description : 

The ion flag is used to enable or disable interrupts. When ion is 
a 1, interrupts are enabled. When ion is a 0, interrupts are dis- 
abled. 

The next sequential instruction is always executed even though 
interrupts may be pending when ion is set to 1. 

The ion flag enables or disables the interval timer, power fail, 
and the CPU virtual channel interrupts. 



Notes i 



1 Instructions exist to test the value of the ion. 

2 One additional instruction is always executed before any 
interrupts are taken. 
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PURGE INSTRUCTION CACHE pich 



Purpose: 

Purge the entire ICACHE. 



Format : 




| Opcode | Ak | 


15 3,2 





Operation : 




ICACHE_valid_bits = 




PSW: 




Exceptions : 




Opcode : 





pich O111110O0O110 Purge the instruction cache 

Description : 

All the entries in the instruction cache are purged. After the exe- 
cution of the pich instruction, the instruction cache is reloaded 
from main memory. 



Notes : 



1 The Ak field of the Instruction is unused. The pich 
instruction does not affect the results produced by the 
currently executing program, only its performance. The 
pich instruction is typically used by a language 
debugger to PURGE the Instruction Cache after a modifi- 
cation of instruction space is performed. 

2 This is NOT a privileged instruction. 
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PURGE LOGICAL CACHE 



plch 



Purpose: 

To purge the logical cache. 

Format : 



| Opcode | Ak | 


15 3,2 

Operation : 

LCACHE_valid_bits = 






PSW: 

Exceptions : 
Opcode : 



P lch 0111110000111 Purge the logical cache 

Description: 

All the entries in the logical cache are purged. After the execu- 
tion of the plch instruction, the logical cache must be reloaded 
from main memory. 



Notes : 



This plch instruction is used by a process that is con- 
currently performing I/O and executing another task 
(multi-tasking within a process) . The typical use of 
plch occurs when the pended task blocked or on I/O is 
completed and is subsequently dispatched and becomes 
running. As part of the dispatching mechanism, the log- 
ical cache must be purged. When process multiplexing 
occurs (by reloading the process SDR's), the logical 
cache is automatically purge. 

The plch instruction prevents previous data (stale data) 
from the pended task from being re-referenced when the 
pended task makes the transition to running. This 
instruction is only necessary for multiple tasks within 
the same process . 

This instruction is NOT privileged, and Ak is a don't 
care. 
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MOVE SCALAR/ITR mov Sk, ITR 



Purpose : 

To move Sk to the interval timer registers (nitc, itsr , itc) 



Format: 




| Opcode 


1 Sk | 


15 

Operation: 

nitc = Sk<59 . . 40> 
itsr = Sk<63. .60> 
itc = Sk<27. .8> 


3,2 



PSW: 

Exceptions: 

Ring Violation; Privileged Instruction 

Opcode: 

mov Sk,ITR 0111110001101 Load NITC, ITC, ITSR from Sk 

Description : 

The contents of Sk are used to load the next iteration count regis- 
ter, the iteration counter, and the iteration status register of 
Sk. 

Notes : 

1 For a description of the interval timer and its associ- 
ated registers, please see the introduction to Chapter 
12 in the COJCffiX Arcblfcactuce Handbook . 
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MOVE ITR/SCALAR mov ITR sk 



Purpose : 

To move the interval timer registers (itcnitc, itsr) to Sk 

Format: 

I Opcode | Sk | 

15 3,2 

Operation : 

Sk<59 . . 40> = nitc 
Sk<63..60> = itsr 
Sk<27. .8> = itc 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction; 01333=8, qualifier=0 

Opcode : 

mov ITR,Sk 0111110001100 Move the itc, itsr, nitc into Sk 

Description : 

The current value of the iteration counter, next iteration counter, 
and the interval timer status register are loaded into Sk. 

Notes : 

See the description of the interval timer and its associated regis- 
ters at the beginning of this chapter. 



Section 12.2 12-16 



Privileged Control/Status Instruction Set 

MOVE SCALAR/I TSR mov Sk,ITSR 



Purpose: 

To Sk<63 . . 0> to the ITSR register 

Format : 

| Opcode | Sk | 

15 3,2 

Operation: 

ITSR = Sk<63. .60> 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction; class=8, qualifier=0 

Opcode : 

mov Sk,itsr 0111110001111 Load ITSR with a scalar 

Description : 

The current value of itsr is loaded from the specified Sk. 

Notes: 

See the description of the interval timer and its associated regis- 
ters at the beginning of this chapter. 
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TRANSMIT INTERRUPT xmti sk 



Purpose: 

To interrupt a channel 

Format : 

I Opcode | Sk | 

15 3,2 

Operation : 

Assert the channel interrupt line of virtual channel c(Sk<7..0>) 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode: 

xmti Sk 0111110101101 Transmit Interrupt 

Description: 

An interrupt to the specified virtual channel is asserted. The 
least significant 8 bits of Sk (Sk<7. .0>) indicate which of the 256 
possible virtual channels is interrupted. 



Notes i 



1 Channels 0, 1, ...,7 are associated with the CPU. These 
CPU channels are referred to as CPU virtual channels. 
Interrupts to these channels are maskable; thus, a CPU 
can interrupt itself by referencing virtual channels 
(0,1,... ,7). Please see the mski instruction. 

2 Sk bits <63..8> are not used. 



Section 12.2 12 _ 18 



Privileged Control /Status Instruction Set 

MASK INTERRUPT mski Sk 



Purpose: 

Mask the virtual channels 

Format: 

| Opcode | Sk | 

15 3,2 

Operation: 

Mask out, individually, the virtual channels using Sk<7..0> 

PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode : 

mski Sk 0111110101100 Mask Out Interrupt 

Description : 

The least significant 8 bits of Sk are used as a mask. Bit masks 
out virtual channel 0, bit 1 masks out virtual channel 1, and so 
on- A inhibits the interrupt from a channel. A 1 enables the 
interrupt from the channel. Each channel can be individually 
masked out independently from the others. 

If concurrent interrupts are pending on multiple enabled virtual 
channels, then the interrupts are responded to in the following 
order : 

-- highest priority 
7 -- lowest priority 



Notes ; 



1 The operating system must explicitly perform mask outs 
upon interrupt service. This may require the saving of 
any previous mask values. 
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HALT 



halt #N,Ak 



Purpose : 

To halt the central processing unit. 



Format: 



| Opcode |L| 000 | Ak | | N | 

15- 6,5 3,2 31|16 

Operation : 

assert an I/O interrupt request 

Ak = immediate ! Ak is loaded to indicate the halt reason 
PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode: 

halt #N,Ak 000100000 Halt the central processing unit 

Description: 

The immediate field is loaded into Ak, and the central processing 
unit is halted. Further action is machine implementation depen- 
dent. 

Notes : 

This instruction is typically used for diagnostic and debugging 
purposes . 
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EXECUTE DIAGNOSTIC MICROCODE diag Ak 



Purpose : 

To execute a desired sequence of non-standard microcode 

Format: 



| Opcode | Ak | 

15 3,2 



Operation: 

Execute microcode sequence pointed to by the contents of Ak 

PSW: 

Exceptions : 

Undefined Opcode ! Ak does not contain valid operation code 
Ring Violation ! Privileged Instruction; 01838=8, qualifier=0 

Opcode : 

diag Ak 0111110111000 Execute non-standard microcode sequence 

Description: 

This instruction invokes one of a set of privileged instructions. 
Ak contains an opcode used by the microcode to jump to a desired 
sequence of microcode. Thus, while only one instruction opcode is 
used, multiple non-standard operations are accessible using this 
instruction. For a list of these operations, see below. An ille- 
gal opcode trap occurs if the contents of Ak are not supported by 
this instruction. 



1 enable lcache 

2 disable lcache 

3 store P_cache at (aS) 

4 load P_cache from (a5) 

5 store Addr_Trans_caches at (a5) 

6 load Addr_Trans_caches from (a5) 

11 store sdrs 0-7 at (a5) 

12 enable forced faults 

13 disable forced faults 

14 flush physical cache 

15 store hardware physical address contents 

16 load hardware physical address contents 

17 enable halt in rings 1-4 

18 disable halt in rings 1-4 
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AS is used as a logical address which references an area in memory. 
All loads and stores use A5 as a base address, incrementing A5 as 
successive addresses are accessed. 

Notes : 

1 This instruction is used by diagnostics to reference 
internal processor registers not accessible to the user 
program and is specific to the C-l implementation.- 
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MOVE SCALAR/W mov Sk,W 



Purpose: 

To move the least significant bit of Sk to the vector valid flag. 



Format: 
















1 


Opcode 






1 


Sk | 




15 








3, 


2 


Operation: 
IF 


(Sk<0> 
W 


= 


1) 

1 


THEN 






ELSE 














W 


= 










PSW: 

Exceptions : 

Ring Violation; Privileged Instruction 

Opcode : 

mov Sk,W 0111110101110 Move scalar to vector valid flag 

Description : 

The current ring of execution must be 0; otherwise, a privileged 
instruction exception occurs. If the current ring is 0, the least 
significant bit of Sk (bit<0>) is moved to the vector valid flag. 
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TEST VECTOR VALID tstvv 



Purpose: 

To test the value of the vector valid flag. 



Format : 




| Opcode 


1 Sk | 


15 


3,2 


Operation : 




IF (W = 1) THEN 




SC = 1 




ELSE 




SC = 





PSW: 

SC is affected (see above) . 

Exceptions : 

Opcode : 

tstvv 0111110101111 Test value of vector valid flag 

Description : 

The SC bit is loaded with the value of the vector valid flag. 



Notes ; 



1. Sk is unused. 

1. This instruction is not privileged. 

1. A brs instruction is typically used to determine the 

value of the SC bit after the execution of the tstvv 

instruction . 
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CHAPTER 13 

13 Vector/Scalar Instruction £<s£ 



13.1 Qyervisw 



This chapter defines the instructions which manipulate the vector accumula- 
tors (V), scalar accumulators (S) , and the vector merge (VM) register. The 
instructions which manipulate these registers are separate and distinct 
from the instructions which deal with the address registers. This distinc- 
tion permits overlapped execution of instructions, which then perform 
operations on these registers (V, S, and VM) , for increased performance. 
The vector/scalar instructions include Loads and Stores, Vector/Vector 
Arithmetics, Vector /Scalar Arithmetics, Vector/Vector Logical Operations, 
and Vector/Scalar Logical Operations. 

As with the A registers; the basic instruction set philosophy is for all 
memory operands to be loaded first into a scalar or vector accumulator; 
then a register to register instruction performs the specified operation. 

There are some additional features applicable to the S and V registers that 
are not applicable to the A registers. These features are: data types mani- 
pulated, chaining, functional unit reservation, register unit reservation, 
and register topology. 



13.1.1 Data Tyj^es 

There are 6 different data types; integer 8, 16, 32, and 64, referenced as 
byte, halfword, word, and longword, respectively; and single and double 
precision floating point (32 and 64 bit) . Logical data types are a special 
case of integer 64 bit. 



13.1.2 Vector Register. Ses 

As previously discussed in Chapter 3, "Register Set, " a vector accumulator 
can hold up to 128 elements, where each element can be up to 64 bits of 
precision. The VL register specifies the exact number of elements contained 
in the vector accumulator. VL applies to all vector accumulators. 

Note: when VL is zero, no vector operation is performed. 

Care should be taken when loading the VL register. When an operation is 
initiated, VL is copied into the internal machine state. Length control is 
generated from this internal VL value. However, when chaining is initiated, 
VL should not be changed until after the last chained instruction begins 
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executing. 

The first element in a vector accumulator ,i, is Vi (0) . If VL is less than 
128, then the Vi (0) through Vi(VL-l) elements are manipulated. All other 
elements are unimportant and are left unchanged. 

13.1.3 Chaining 

Chaining is a vector mechanism that permits the output of one vector 
instruction to be immediately used as the input to another vector instruc- 
tion. For example, the DOT product operation requires a sum (sigma) of a 
series of products. Chaining permits the sum to be initiated while the pro- 
ducts are being produced. This form of concurrency results in signifi- 
cantly higher performance. 

To facilitate chaining, vector register operations can specify up to three 
operands; two sources and one destination. This feature permits an output 
register to be different from either of the two possible input registers (3 
operand addressing) . The output register can then be used as an input for 
the second (chained) operation. 

The following is an example of chaining: 

V3 = V2 + VI 
V5 = V4 * V3 

In the above example, the output of the "+" that is stored into.V3 can be 
immediately used as an input to the "*" operation, which is essentially 
equivalent to executing the single statement: 

V5 = V4 * V3 = V2 + VI 

This single statement equivalent results in execution rates twice that 
achieved through sequential execution. 

Another example is: 

VI = Merge (VM,V0;S0) 
V3 = V2+V1 
V5 = V4*V3 

This executes as the single expression: 

V5 = V4*V3 = V2 + VI = Merge (VM,V0; SO) 

which is essentially adding V2 to the merged elements of V0, and multiply- 
ing the results by V4. The output of one functional unit may be chained 
into the input of a different functional unit. The actual time of chaining 
can occur anytime the output is available. The output can be chained into 
stores, masks, reduction operations, etc. Unless otherwise specified, 
there are no exceptions to this chaining criterion. In this last example. 
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chaining exists across three functional units . 

13.1.4 Fu nction al Ilnii Reservation 

As explained above, several instructions can be executed at the same time 
because more than one arithmetic or functional unit is provided. Multiple 
functional units (e.g., ADD, MULTIPLY, and DIVIDE) --rather than multiple 
units of the same type (i.e., two or more adders) --are provided to ensure 
this higher performance. However, when a vector instruction is decoded 
which requires a functional unit currently being used, a functional unit 
reservation occurs. This implementation means that the second instruction 
CANNOT execute simultaneously with the first. 

The types of independent functional units that are present are implementa- 
tion dependent. Generally, the possible arithmetic units are: 

1 ADD/SUB 

2 MULTIPLY/DIVIDE 

3 LOGICAL 

4 MASK/MERGE/COMPRESS 

5 LOAD FROM MEMORY 

6 STORE TO MEMORY 

These arithmetic units are structured into three separate and distinct 
functional units, grouped as follows: 

1 ADD/SUB, LOGICAL, COMPARE, POPULATION COUNT, SHIFTING 

2 MULTIPLY/DIVIDE 

3 LOAD/STORE FROM MEMORY, MASK/MERGE/COMPRESS 



The following is an example of functional unit reservation: 

V2 = VI + VO 
V5 = V4 + V3 

Both of these operations on the indicated vector registers require the ADD 
functional unit, and thus cannot function simultaneously. 



13.1.5 Register Ilnii Resei 

To permit simultaneous instruction execution with multiple functional 
units, multiple registers must be provided, which means that access must be 
provided for all data to be manipulated. If this access is NOT provided, 
register reservation occurs, and instruction execution is sequential. In 
the following example, 

VI = VI + VO 
V4 = V3 * VI 
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the two instructions are executed sequentially since there are three refer- 
ences to VI across the two instructions. This sequential execution occurs 
even though different functional units are specified. If the two instruc- 
tions were, 

V2 = VI + VO 
V5 = V4 * V3 

then both instructions would execute simultaneously. 



13.1.6 



Topology 



The CONVEX architecture has 8 vector accumulators. Within the processor of 
CONVEX-1, these registers are structured in the following manner across 4 
memory elements. 



| V4 
I VO 



V5 
VI 



I I 
I I 



V6 
V2 



V7 
V3 



Each of the above rectangles represents a high speed memory array contained 
within the processor. There are 4 such arrays; each has 2 vector accumula- 
tors with 64 bits in each element. The following are the access rules 
governing manipulations of elements in the array. 



NOTE: The noted vector accumulator topology is implementation specific 
CONVEX-1 and is NOT part of the architecture. 



to 



1) Two independent accesses can occur during each cycle for each array. 

2) These accesses can be any combination of read and write. Thus 
(read, read), (read, write) , (write, read) , and (write, write) are permitted. 

3) If more than two accesses are specified to the same array, the operation 
will still continue to function, but at reduced performance. 

4) Read and writes to the same array during the same operation are permit- 
ted. There are no unusual side effects; the array functions exactly as one 
would expect. For example, the operation VO = VO + VI adds vector accumula- 
tor to vector accumulator 1 and stores the result in vector accumulator 
0. 
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The following are some examples of full and partial speed operations as 
they relate to^this array accessing. 

The operation V2 = VI + VO proceeds at full speed. The operation V2 = V2 + 
VO proceeds at full speed. The operation V2 = V2 + V6 proceeds at half 
speed, since both V2 and V6 are in the same array. 



13.1.7 Recursion/Reduction 

Reduction operators are explicitly provided and are not just a byproduct 
of vector accumulator specification. Explicit operators are provided to 
perform SUM, PROD, MAX, MIN, and various logical reductions (ANY, ALL, 
PARITY) . The specification of a vector accumulator as both a source and 
destination causes the accumulator to function in the expected way, and 
there are no unusual side effects. The output of an operation can be 
chained into a reduction operation. For example, a dot product is a multi- 
ply chained into a sum operation. 



13.1.8 Scalar Fractional Units 

From the compiler's viewpoint, there is an infinite number of scalar func- 
tional units. Thus, for the arithmetic expression, 

Z = A+B + C + D + E + F 

the preferred way of evaluating the expression is: 

Tempi = A+B 
Temp2 = C+D 
Temp3 = E+F 

Tempi = Tempi + Temp 2 
Tempi = Tempi + Temp3 

This expression is in preference to: 

Temp = A +B 

Temp = Temp + C 

Temp = Temp + D 

Temp = Temp + E 

Temp = Temp + F 

This technique is referred to as "tree height reduction." 
13.2 Loads and Stores (Gather and Scatter) 



These instructions include Load Vector Register, Store Vector Register, 
Load Vector Register /Vector Index, Store Vector Register /Vector Index, 
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Store Scalar Extended/Vector Index, and Store Scalar Extended. The Load 
and Store with index instructions allow the user to use a vector register 
to specify those indices of another vector which will be affected. These 
operations are commonly referred to as gather and scatter. None of the 
flags in the PSW are affected. This permits scatter stores and gather 
loads to be implemented. 

VL elements of a vector are loaded into the specified vector accumulator 
Vk. The first element is referenced by the effective address produced by 
evaluating the L,@>A fields. The address of the next element accessed is 
obtained by adding the signed value in VS. This signed value is the dis- 
tance in BYTEs. The address of every successive element is obtained by 
adding VS to the address of the previous element. 



13 . 3 Vector/Vector 

Included in this section are the following instructions: Add, Subtract, 
Multiply, Divide, and Negate Vector/Vector. The flags in the PSW are 
affected, as follows: 

SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only 
UN = Exponent Underflow; Floating Point Only 

SDZ = Divide by Zero; Integer Only 
RO = Reserved Operand; Floating Point Only 

FDZ = Divide by Zero; Floating Point Only 



13.4 Yaciox/Scaiar 

The instructions in this section include Add, Subtract, Multiply, and 
Divide Vector/Scalar. They affect the following flags in the PSW: 

SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Eloating Point Only 
UN = Exponent Underflow; Floating Point Only 

SDZ = Divide by Zero; Integer Only 
RO = Reserved Operand; Floating Point Only 

FDZ = Divide by Zero; Floating Point Only 



13.5 Ysciox/Yeciox Logical Qcexaiifliia 

These instructions include AND, OR, Exclusive OR, and Complement 
Vector/Vector; none of the flags in the PSW is affected by their operation. 
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13.6 Vector/Scalar Logical Operations 

Instructions included in this group are AND, OR, and Exclusive OR Vector/ 
Scalar. No flags in the PSW are affected by their operation. 

13.7 Shifts and Mcjiss 

Logical Shift Vector/Scalar, Move Scalar /Vector, and Move Vector Element/ 
Scalar are the instructions in this section, and none affects the flags in 
the PSW. 
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LOAD VECTOR REGISTER 



Id. (b|h|w|l|s|d) <effa>,Vk 



Purpos 



Format: 



To load a vector into a vector accumulator. 



| Opcode |@|L| A j | Vk | | Displacement 

15 8,7,6,5 3,2 (31|15) 

Operation : 

temp = Effective Address 

DO 10 a = 0, (VL-1) 

Vk(a) = c(temp) 

temp = temp + VS 
10 CONTINUE 

PSW: 



I 




Exceptions : 



Opcode : 



ld.b <effa>,Vk 
ld.h <effa>,Vk 
ld.w <effa>,Vk 
ld.l <effa>,Vk 
ld.s <effa>,Vk 
ld.d <effa>,Vk 



001110000 Load vector byte 

001110010 Load vector halfword 

001110100 Load vector word 

001110110 Load vector longword 

001110100 Load vector single float 

001110110 Load vector double float 



Description : 

VL elements of a vector are loaded into the specified vector accu- 
mulator Vk. The first element is referenced by the effective 
address produced by evaluating the L,@,A fields. The address of the 
next element accessed is obtained by adding the signed value in VS. 
This signed value is the distance in BYTEs. 

The address of every successive element is obtained by adding VS to 
the address of the previous element. 



Notes : 



1 The value contained in VS can either be positive or 
negative . 

2 64 bit integers or 64 bit floating point operands are 
loaded using the ld.l instruction. 
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32 bit integer or 32 bit singal precision floating point 
operands are loaded using the ld.w instruction. 
If the distance between successive elements is less than 
the precision of an element, unpredictable actions 
occur . 

VS is not changed during the execution of this instruc- 
tion. 

The .s and .w forms of this instruction are equivalent, 
as are the .d and .1 forms. The . s and .d forms are 
added for convenience. 
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STORE VECTOR REGISTER 



st. (b|h|w|l|s|d) Vk,<effa> 



Purpose : 



To store a vector from a vector accumulator. 



Format : 



| Opcode |@|L| Aj| Vk| | Displacement 

15 8,7,6,5 3,2 (31|15) 

Operation : 

temp = Effective Address 
DO 10 a = 0, (VL-1) 

c(temp) = Vk(a) 

temp = temp + VS 
10 CONTINUE 

PSW: 



Exceptions : 



Opcode : 



st.b Vk,<effa> 
st.h Vk,<effa> 
st.w Vk,<effa> 
st.l Vk,<effa> 
st.s Vk,<effa> 
st.d Vk,<effa> 



001111000 Store vector byte 

001111010 Store vector halfword 

001111100 Store vector word 

001111110 Store vector longword 

001111100 Store vector single float 

001111110 Store vector double float 



Description : 

VL elements of a vector are stored from the specified vector accu- 
mulator Vk. The first element is referenced by the effective 
address produced by evaluating the L,@,A fields. The address of the 
next element accessed is obtained by adding the signed value in VS. 
This signed value is the distance in BYTEs. 

The address of every successive element is obtained by adding VS to 
the address of the previous element. 



Notes : 



1 The value contained in VS 
negative . 



can either be positive or 



2 64-bit integers or 64-bit floating point 
stored using the st . 1 instruction . 



operands are 



3 32-bit integers or 32-bit single precision floating 
point operands are stored using the st.w instruction. 
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4 If the distance between successive elements is less than 
the precision of an element, unpredictable actions 

. occur . 

5 VS is not changed during the execution of this instruc- 
tion. 

6 The . s and .w forms of this instruction are equivalent, 
as are the . d and . 1 forms . The . s and . d forms are 
added for convenience. 
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LOAD VECTOR REGISTER/VECTOR INDEX 



ldvi. (b|h|w|l|s|d) Vj,Vk 



Purpose : 



To load a vector into a vector accumulator using 
indices (commonly referred to as "vector gather") . 



vector of 



Format: 



| Opcode I Vj | Vk | 
15 6,5 3,2 

Operation : 

DO 10 a =0, (VL-1) 

temp = A5 + Vj(a)<31..0> 
Vk (a) = c<temp> 

10 CONTINUE 

PSW: 



Exceptions : 




Opcode : 




ldvi.b Vj,Vk 


0111100000 


ldvi.h Vj,Vk 


0111100001 


ldvi . w Vj , Vk 


0111100010 


ldvi.l Vj,Vk 


0111100011 


1 dvi . s V j , Vk 


0111100010 


ldvi.d Vj,Vk 


0111100011 



Index Load vector byte 
Index Load vector half word . 
Index Load vector word 
Index Load vector longword 
Index Load vector single float 
Index Load vector double float 

Description : 

VL elements of a vector are loaded into the specified vector accu- 
mulator, Vk. The first element is referenced by the effective 
address produced by adding the contents of A5 and Vj (0) . The second 
element is referenced by the effective address produced by adding 
the contents of AS and the least significant 32 bits of Vj (1) , and 
so on. 



Notes : 



1 A5 is typically loaded with an Idea 
address instruction.) 



{load effective 



2 64 bit integers or 64 bit floating point operands are 
loaded using the ldvi.l instruction. 

3 If the distance between successive elements is less than 
the precision of an element, unpredictable actions 
occur. 



Section 13.7 



13-12 



Vector/Scalar Instruction Set 



4 A5 is not changed during the execution of this instruc- 
tion. 

5 The contents of Vj (i) are treated as a byte offset. The 
appropriate shift of Vj may have to be performed to 
account for the precision of the operand to be loaded. 

6 The .s and .w forms of this instruction are equivalent, 
as are the . d and . 1 forms . The . s and . d forms are 
added for convenience. 
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STORE VECTOR REGISTER/VECTOR INDEX 



stvi. (b|h|w|l|s|d) Vk,Vj 



Purpose s 



To store a vector from a vector accumulator using 
indices (commonly referred to as "vector scatter") . 



vector of 



Format: 



I Opcode | Vj | Vk | 
15 6,5 3,2 

Operation : 

DO 10 a = 0/ (VL-1) 

temp = Vj(a<31..0>) + A5 
c (temp) = Vk (a) 

10 CONTINUE 

PSW: 



Exceptions : 




Opcode : 




stvi.b Vk,Vj 


0111101000 


stvi.h Vk,Vj 


0111101001 


stvi.w Vk,Vj 


0111101010 


stvi.l Vk,Vj 


0111101011 


stvi.s Vk,Vj 


0111101010 


stvi.d Vk,Vj 


0111101011 



Index Store vector byte 
Index Store vector half word 
Index Store vector word 
Index Store vector longword 
Index Store vector single float 
Index Store vector double float 

Description: 

VL elements of a vector are stored from the specified vector accu- 
mulator, Vk. The first element is referenced by the effective 
address produced by adding the contents of A5 and the least signi- 
ficant 32 bits Vj (0) . The second element is referenced by the 
effective address produced by adding the contents of A5 and Vj (1) , 
and so on. 

Notes : 

1 A5 is typically loaded with an Idea (load effective 
address instruction.) 

2 64-bit integers or 64-bit floating point operands are 
stored using the stvi.l instruction. 

3 32-bit integers or 32-bit single precision floating 
point operands are stored using the stvi.w instruction. 

4 If the distance between successive elements is less than 
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the precision of an element, unpredictable actions 
occur . 

5 A5 is not changed during the execution of this instruc- 
tion. 

6 The contents of Vj (i) are treated as a byte offset. The 
appropriate shift of Vj may have to be performed to 
account for the precision of the operand to be stored. 

7 The .s and .w forms of this instruction are equivalent, 
as are the . d and . 1 forms . The . s and . d forms are 
added for convenience. 
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STORE SCALAR EXTENDED/VECTOR INDEX 



stvi. (b|h|w|l|s|d) Sk,Vj 



Purpose: 



To store a vector 
indices 



from a scalar accumulator using a vector of 



Format ; 



| Opcode | Vj | Sk | 
15 6,5 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

temp = Vj(a<31..0>) + AS 

c (temp) = Sk 
10 CONTINUE 

PSW: 



Exceptions : 
















Opcode : 
















stvi.b 


Sk,Vj 


0111101100 


Scalar 


Index 


Store 


vector 


byte 


stvi.h 


Sk,Vj 


0111101101 


Scalar 


Index 


Store 


vector 


halfword 


stvi . w 


Sk,Vj 


0111101110 


Scalar 


Index 


Store 


vector 


word 


stvi . 1 


Sk,Vj 


0111101111 


Scalar 


Index 


Store 


vector 


longword 


stvi . s 


Sk,Vj 


0111101110 


Scalar 


Index 


Store 


vector 


single float 


stvi . d 


Sk,Vj 


0111101111 


Scalar 


Index 


Store 


vector 


double float 



Description: 

VL elements of a vector are stored from the specified scalar accu- 
mulator, Sk. The first vector element to be stored into is refer- 
enced by the effective address produced by adding the contents of 
A5 and the least significant 32 bits Vj (0) . The second element is 
referenced by the effective address produced by adding the contents 
of A5 and Vj (1) , and so on. 



Notes : 



1 A5 is typically loaded with an Idea 
address instruction.) 



(load effective 



2 64-bit integers or 64-bit floating point operands are 
stored using the stvi . 1 instruction . 

3 32-bit integers or 32-bit floating point operands are 
stored using the stvi.w instruction. 

4 If the distance between successive elements is less than 
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the precision of an element, unpredictable actions 
occur . 

5 A5 is not changed during the execution of this instruc- 
tion. 

6 The contents of Vj (i) are treated as a byte offset. The 
appropriate shift of Vj may have to be performed to 
account for the precision of the operand to be stored. 

7 The . s and .w forms of this instruction are equivalent, 
as are the . d and . 1 forms . The . s and . d forms are 
added for convenience . 
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STORE SCALAR EXTENDED 



ste. (b|h|w|l|s|d) Sk, <ef fa> 



Purpose: 



To store a scalar register repetitively into memory. 



Format : 



| Opcode |@|L|Aj |Sk | 

15 8,7 6,5,3,2 

Operation : 

temp = Effective Address 
DO 10 a = 0, (VL-1) 

c<temp> = Sk 
temp = temp + VS 
10 CONTINUE 

Exceptions : 

Opcode : 

ste.b Sk,<effa> 001001000 
ste.h Sk,<effa> 001001010 
ste.w Sk,<effa> 001001100 
ste.l Sk,<effa> 001001110 
ste.s Sk,<effa> 001001100 
ste.d Sk,<effa> 001001110 



| Displacement | 
(31 | 15) 



Store an extended scalar byte 
Store an extended scalar halfword 
Store an extended scalar word 
Store an extended scalar longword 
Store an extended scalar single float 
Store an extended scalar double float 



Description: 

The contents of the scalar register Sk are repetitively stored into 
memory until VL. 



Notes : 



1 This instruction is used to perform the loop construct 
A(I)=K. 

2 Sk is unchanged at the completion of this instruction. 

3 This instruction can be used to clear memory by initial- 
izing Sk with 0. 
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4 The . s and .w forms of this instruction are equivalent 
as are the . d and . 1 forms . The . s and . d forms are 
added for convenience . 
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ADD VECTOR/VECTOR add. (b |h |w 1 1 |s |d) Vi,Vj,Vk 



Purpose : 

To add two vectors. 



Format : 



| Opcode | Vi | Vj | Vk | 
15 9,8 6,5 3,2 



Operation: 



DO 10 a = 0, (VL-1) 

Vk(a) - Vi(a) + Vj(a) 
10 CONTINUE 



PSW: 



SIV = Integer Overflow! Integer Only 
OV = Exponent Overflow! Floating Point Only 
UN = Exponent Under flowz! Floating Point Only 
RO = Reserved Operand! Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

add.b Vi,Vj,Vk 1100000 Add vector/vector integer byte 

add.h Vi,Vj,Vk 1100001 Add vector/vector integer halfword 

add.w Vi,Vj,Vk 1100010 Add vector/vector integer word 

add.l Vi,Vj,Vk 1100011 Add vector/vector integer longword 

add.s Vi,Vj,Vk 1011000 Add vector/vector single float 

add.d Vi,Vj,Vk 1011001 Add vector/vector double float 

Description : 

The contents of the vector register Vi are added to the contents of 
the vector register V j , and the vector result is loaded into the 
vector register Vk. The number of elements added is determined by 
the value of VL at the time execution begins. 

Notes : 

Hold issues: functional unit and register reservation. 
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SUBTRACT VECTOR/VECTOR sub . (b | h | w 1 1 | s | d) Vi , Vj , Vk 



Purpose: 

To subtract two vectors . 

Format: 



| Opcode | Vi | Vj | Vk | 

15 9,8 6,5 3,2 

Operation : 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) - Vj(a) 
10 CONTINUE 

PSW: 

SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only 
UN = Exponent Underflow; Floating Point Only 
RO = Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

sub.b Vi,Vj,Vk 1101000 Subtract vector/vector integer byte 

sub.h Vi,Vj,Vk 1101001 Subtract vector/vector integer halfword 

sub.w Vi,Vj,Vk 1101010 Subtract vector/vector integer word 

sub.l Vi,Vj,Vk 1101011 Subtract vector/vector integer longword 

sub.s Vi,Vj,Vk 1011010 Subtract vector /vector single float 

sub.d Vi,Vj,Vk 1011011 Subtract vector /vector double float 

Description : 

The contents of the vector register Vj are subtracted from the con- 
tents of the vector register Vi, and the vector result is loaded 
into the vector register Vk. The number of elements subtracted is 
determined by the value of VL at the time execution begins. 

Notes : 
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MULTIPLY VECTOR/VECTOR 



mul. (b|h|w|l|s|d) Vi,Vj,Vk 



Purpose : 



To multiply two vectors. 



Format: 



| Opcode I VI | Vj | Vk | 

15 9,8 6,5 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) * Vj (a) 
10 CONTINUE 

PSW: 

SIV = Integer Overflow; Integer only 
OV = Exponent Overflow; Floating Point Only 
UN * Exponent Underflow; Floating Point Only 
RO = Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode : 



mul.b Vi,Vj,Vk 
mul.h Vi,Vj,Vk 
mul.w Vi,Vj,Vk 
mul.l Vi,Vj,Vk 
mul.s Vi,Vj,Vk 
mul.d Vi,Vj,Vk 



1110000 Multiply vector/vector 

1110001 Multiply vector/vector 

1110010 Multiply vector/vector 

1110011 Multiply vector/vector 

1001000 Multiply vector/vector 

1001001 Multiply vector/vector 



integer byte 
integer half word 
integer word 
integer longword 
single float 
double float 



Description : 

The contents of the vector register Vi are multiplied by the con- 
tents of the vector register V j , and the vector result is loaded 
into the vector register Vk. The number of elements multiplied is 
determined by the value of VL at the time execution begins. 

Notes : 
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DIVIDE VECTOR/VECTOR 



div. (b|h|w|l|s|d) Vi,Vj,Vk 



Purpose: 



To divide two vectors . 



Format: 



| Opcode | Vi | Vj | Vk | 

15 9,8 6,5 3,2 O 

Operation: 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) / Vj (a) 
10 CONTINUE 

PSW: 

SIV = Integer Overflow 
OV = Exponent Overflow 
UN = Exponent Underflow 
SDZ = Divide by Zero 
RO = Reserved Operand 
FDZ = Divide by Zero 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Divide by Zero 
Reserved Operand 



Integer Only 
Floating Point Only 
Floating Point Only 
Integer Only 
Floating Point Only 
Floating Point Only 



Opcode : 



div.b Vi,Vj,Vk 1111000 Divide vector/vector 

div.h Vi,Vj,Vk 1111001 Divide vector/vector 

div.w Vi,Vj,Vk 1111010 Divide vector/vector 

div.l Vi,Vj,Vk 1111011 Divide vector/vector 

div.s Vi,Vj,Vk 1001010 Divide vector/vector 

div.d Vi,Vj,Vk 1O01O11 Divide vector/vector 



integer byte 
integer halfword 
integer word 
integer longword 
single float 
double float 



Description : 

The contents of the vector register Vi are divided by the contents 
of the vector register V j , and the vector result is loaded into the 
vector register Vk. The number of elements divided is determined by 
the value of VL at the time execution begins. 
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Notes: 

Hold issues: functional unit and register reservation. 
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NEGATE VECTOR/VECTOR 



neg. (b|h|w|l |s|d) Vj / Vk 



Purpose : 

To negate a vector. 

Format: 

| Opcode | Vj | Vk | 

15 6,5 3,2 

Operation : 

DO 10 a = 0, (VL-1) 

Vk(a) = - Vj(a) 
10 CONTINUE 

PSW: 

SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only 
UN = Exponent Underflow; Floating Point Only 
RO = Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode : 



neg.b Vj,Vk 
neg.h Vj,Vk 
neg.w Vj,Vk 
neg.l Vj,Vk 
neg. s Vj,Vk 
neg.d Vj,Vk 



0110111000 Negate vector/vector integer byte 

0110111001 Negate vector/vector integer halfword 

0110111010 Negate vector/vector integer word 

0110111011 Negate vector/vector integer longword 

0110010010 Negate vector/vector single float 

0110010011 Negate vector/vector double float 



Description: 

The algebraic negation of vector register Vj is loaded into Vk. 
The number of elements negated is determined by the value of VL at 
the time execution begins . 

Notes : 
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ADD VECTOR/SCALAR add. (b|h|w| 1 |s|d) Vi,Sj,Vk 



Purpose: 

To add a scalar to a vector. 

Format : 

| Opcode | Vi | Sj | Vk | 

IS 9,8 6,5 3,2 



Operation: 



DO 10 a = 0, (VL-1) 
Vk(a) - Vi(a)+Sj 
10 CONTINUE 



PSW: 



SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only 
UN = Exponent Underflow; Floating Point Only 
RO = Reserved Operand; floating point only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

add.b Vi,Sj,Vk 1100100 Add vector/scalar integer byte 

add.h Vi Sj,Vk 1100101 Add vector/scalar integer halfword 

add.w Vi,Sj,Vk 1100110 Add vector/scalar integer word 

add.l Vi,Sj,Vk 1100111 Add vector/scalar integer longword 

add.s Vi,Sj,Vk 1011100 Add vector/scalar single float 

add.d Vi,Sj,Vk 1011101 Add vector/scalar double float 

Description: 

The contents of the scalar register Sj are added to the contents of 
the vector register Vi, and the vector result is loaded into the 
vector register Vk. The number of elements added is determined by 
the value of VL at the time execution begins. 

Notes : 
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SUBTRACT VECTOR/SCALAR sub. (b |h | w | 1 |s |d) Vi,Sj,Vk 



Purpose: 

To subtract a scalar from a vector. 

Format: 



| Opcode | Vi | Sj | Vk | 

15 9,8 6,5 3,2 O 

Operation : 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) - Sj 
10 CONTINUE 



PSW: 

SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only- 
UN = Exponent Underflow; Floating Point Only 
RO — Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

sub.b Vi,Sj,Vk 1101100 Subtract vector/scalar integer byte 

sub.h Vi,Sj,Vk 1101101 Subtract vector/scalar integer halfword 

sub.w Vi,Sj,Vk 1101110 Subtract vector/scalar integer word 

sub.l Vi,Sj,Vk 1101111 Subtract vector /scalar integer longword 

sub.s Vi,Sj,Vk 1011110 Subtract vector/scalar single float 

sub.d Vi,Sj,Vk 1011111 Subtract vector/scalar double float 

Description : 

The contents of the scalar register Sj are subtracted from the con- 
tents of the vector register Vi, and the vector result is loaded 
into the vector register Vk. The number of elements subtracted is 
determined by the value of VL at the time execution begins. 

Notes: 
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MULTIPLY VECTOR/SCALAR mil . (b | h | w | 1 | s | d) Vi , S j . Vk 



Purpose : 

To multiply a scalar with a vector. 

Format : 

| Opcode | Vi | Sj | Vk | 

15 9,8 6,5 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) * Sj 
10 CONTINUE 

PSW: 

SIV = Integer Overflow; Integer Only 
OV = Exponent Overflow; Floating Point Only 
UN = Exponent Underflow; Floating Point Only 
RO = Reserved Operand; Floating Point Only 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 

mul.b Vi,Sj,Vk 1110100 Multiply vector/scalar integer byte 

mul.h Vi,Sj,Vk 1110101 Multiply vector/scalar integer halfword 

mul.w Vi,Sj,Vk 1110110 Multiply vector/scalar integer word 

mul.l Vi,Sj,Vk 1110111 Multiply vector/scalar integer longword 

mul.s Vi,Sj,Vk 1001100 Multiply vector/scalar single float 

mul.d Vi,Sj,Vk 1001101 Multiply vector/scalar double float 

Description: 

The contents of the scalar register Sj are multiplied with the con- 
tents of the vector register Vi, and the vector result is loaded 
into the vector register Vk. The number of elements multiplied is 
determined by the value of VL at the time execution begins. 

Notes: 
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DIVIDE VECTOR/SCALAR 



div. (b|h|w|l|s|d) Vi,Sj,Vk 



Purpose : 



To divide a vector by a scalar. 



F ormat : 



| Opcode | Vi | Sj | Vk | 

15 9,8 6,5 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) / Sj 
10 CONTINUE 



PSW: 

SIV = Integer Overflow 
OV = Exponent Overflow 
UN = Exponent Underflow 
SDZ = Divide by Zero 
RO = Reserved Operand 
FDZ = Divide by Zero 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Divide by Zero 
Reserved operand 



! Integer Only 

! Floating Point Only 

! Floating Point Only 

! Integer Only 

! Floating Point Only 

! Floating Point Only 



Opcode : 



div.b Vi,Sj,Vk 
div.h Vi,Sj,Vk 
dlv.w Vi,Sj,Vk 
div.l Vi,Sj,Vk 
div.s Vi,Sj,Vk 
div.d Vi,Sj,Vk 



1111100 Divide vector/scalar integer byte 

1111101 Divide vector/scalar integer half word 

1111110 Divide vector/scalar integer word 

1111111 Divide vector/scalar integer longword 

1001110 Divide vector/scalar single float 

1001111 Divide vector/scalar double float 



Description : 

The contents of the vector Vi are divided by the scalar S j , and the 
vector result is loaded into the vector register Vk. The number of 
elements divided is determined by the value of VL at the time exe- 
cution begins. 
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Notes : 

Hold issues: functional unit and register reservation. 
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AND VECTOR/VECTOR and Vi,Vj,Vk 



Purpose : 

To AND the contents of two vectors. 

Format: 

| Opcode | Vi | Vj | Vk | 

15 9,8 6,5 3,2 

Operation : 

DO 10 a = 0, (vl-1) 

Vk(a) = Vi(a) .AND. Vj (a) 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

and Vi,Vj,Vk 1010000 AND two vectors 

Description : 

The elements of the Vj vector register are ANDed with the elements 
of the Vi register. The results of the AND are loaded into Vk. The 
number of ANDs is equal to VL. All 64-bits of each element of a 
vector register participate in the operation . 

Notes : 

The contents of one vector register, Vi, can be moved to another 
vector register, Vk, by specifying Vi and Vj as the same register. 
Thus, for example, AND VO, VO, VI moves the contents of VO to VI. 
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OR VECTOR/VECTOR 



or Vi,Vj,Vk 



Purpose: 

To OR the contents of two vectors. 

Format: 

| Opcode I Vi | Vj | Vk | 
15 9,8 6,5 3,2 

Operation : 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) .OR. Vj (a) 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode: 

or Vi,Vj,Vk 1010001 OR two vectors 

4 

Description : 

The elements of the Vj vector register are ORed with the elements 
of the Vi register. The results of the OR are loaded into Vk. The 
number of ORs is equal to VL. All 64 bits of each element of a vec- 
tor register participate in the operation. 

Notes : 
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EXCLUSIVE OR VECTOR/VECTOR xor Vi,Vj,Vk 



Purpose: 

To Exclusive OR the contents of two vectors. 

Format: 

| Opcode | Vi | Vj | Vk | 

15 9,8 6,5 3,2 
Operation : 



DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) .XOR. Vj (a) 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

xor Vi,Vj,Vk 1010010 Exclusive OR two vectors 

Description : 

The elements of the Vj vector register are exclusive ORed with the 
elements of the Vi register. The results of the exclusive OR are 
loaded into Vk. The number of exclusive ORs is equal to VL. All 64 
bits of each element of a vector register participate in the opera- 
tion. 

Notes: 
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COMPLEMENT VECTOR/VECTOR not V j / Vk 



Purpose : 

To COMPLEMENT the contents of a vector. 

Format : 

| Opcode | Vj | Vk | 

15 6,5 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

Vk(a) = .NOT. Vj (a) 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

not Vj,Vk 0110001011 Complement a vector 

Description : 

The elements of the Vj vector register are complemented and the 
results loaded into the Vk vector register. The number of opera- 
tions is equal to VL. All 64-bits of each element of a vector 
register participate in the operation. 

Notes s 
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AND VECTOR/SCALAR and Vi,Sj,Vk 



Purpose: 

To AND the contents of a vector and a scalar. 

Format : 

| Opcode | Vi | Sj | Vk | 

15 9,8 6,5 3,2 O 
Operation : 



DO 10 a = 0, (Vl-1) 

Vk(a) = Vi(a) .AND. Sj 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

and Vi,Sj,Vk 1010100 AND vector/scalar 

Description : 

The elements of the Vi vector register are ANDed with the contents 
of the Sj register. The results of the AND are loaded into Vk. The 
number of ANDs is equal to VL. All 64-bits of each element of a 
vector register participate in the operation. 

Notes : 
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OR VECTOR/SCALAR or V i,Sj,Vk 



Purpose : 

To OR the contents of a vector and a scalar. 

Format: 

| Opcode | Vi | Sj | Vk | 

15 9,8 6,5 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) .OR. Sj 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

or Vi,Sj,Vk 1010101 OR vector/scalar 

Description : 

The elements of the Vj vector register are ORed with the contents 
of the Sj register. The results of the OR are loaded into Vk. The 
number of ORs is equal to VL. All 64 bits of each element of a vec- 
tor register participate in the operation. 

Notes: 
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EXCLUSIVE OR VECTOR/SCALAR xor Vi,Sj,Vk 



Purpose: 

To Exclusive OR the contents of a vector and a scalar 

Format: 

| Opcode | Vi | Sj | Vk | 

15 9,8 6,5 3,2 

Operation : 

DO 10 a = 0, (VL-1) 

Vk(a) = Vi(a) .XOR. Sj 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

xor Vi,Sj,Vk 1010110 Exclusive OR vector/scalar 

Description: 

The elements of the Vi vector register are exclusive ORed with the 
contents of the Sj register. The results of the exclusive OR are 
loaded into Vk. The number of exclusive ORs is equal to VL. All 
64 bits of each element of a vector register participate in the 
operation. 

Notes: 
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LOGICAL SHIFT VECTOR/SCALAR shf sj,Vk 



Purpose : 

To logically shift the contents of a vector register by a scalar 
register . 

Format: 

| Opcode | Sj | Vk | 

15 6,5 3,2 

Operation : 

DO 10 a = O, (VL-1) 

Vk(a) = Shift Vk(a) by Sj<7. ,0> 
10 CONTINUE 

PSW: 

Exceptions: 

Opcode : 

shf Sj,Vk 0110001100 Shift a vector accumulator 

Description : 

The contents of Vk (i) are shifted according to the contents of S j . 
When Sj is positive Vk(i) is shifted left. When Sj is negative 
Vk(i) is shifted right. Each vector element of Vk until VL is 
shifted. All 64 bits of Vk(i) participate in the shift. Vacated 
positions are zero filled. Only Sj<7..0> are used to control the 
shift. Sj<63..8> are ignored. 



Notes: 



Arithmetic shifts are implemented using multiplies and divides. 
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MOVE SCALAR/VECTOR mov Si,Sj,Vk 



Purpose: 

To move a scalar register to a vector register element. 

Format: 

| Opcode | Si | Sj | Vk | 

15 9,8 6,5 3,2 

Operation: 

Vk(Sj<6. .0>) = Si 

PSW: 

Exceptions : 

Opcode : 

mov Si,Sj,Vk 1000001 Move a scalar to a vector element 

Description: 

The contents of the scalar Register Si are loaded into an element 
of the vector register Vk. The particular Vk(i) is determined from 
bits<6. .0> of S j . All other bits of Sj are ignored. 

Notes : 
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MOVE VECTOR ELEMENT/SCALAR mov Vi,Sj,Sk 



Purpose : 

To move a vector element into a scalar register. 

Format: 

| Opcode | Vi | Sj | Sk | 

IS 9,8 6,5 3,2 

Operation : 

Sk = Vi(Sj<6. .0>) 

PSW: 

Exceptions : 

Opcode : 

mov Vi,Sj,Sk 1000000 Move a vector element to a scalar 

Description t 

The vector element in the Vi register referenced by bits<6. .0> of 
Sj is loaded in Sk. 

Notes : 
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CHAPTER 14 
14 Comparisana/Maak/Merge/Gamprssa Inatruciicns 



The instructions in this chapter perform comparisons between vectors and 
scalars, and perform manipulations using the vector merge (VM) register. 
The general methodology is for a comparison to produce a bit vector, where 
a 1 indicates that the comparison is .TRUE., and a indicates that the 
comparison is .FALSE. This method is similar to the way comparisons with 
address and scalar registers function. The results of a vector compare 
(and consequently the contents of the VM register) are used quite dif- 
ferently from manipulations on the A registers, however. 

On most machines, many common operations performed on vectors, such as 
compress, mask, and merge, require the use of branch instructions. In CON- 
VEX machines, these operations are included as primitives in the instruc- 
tion set. To eliminate the use of branch instructions when manipulating 
vectors, a full set of primitives is provided that implements compress, 
mask, and merge operations on vector accumulators. These primitives permit 
vector operations to be implemented with a sequence of chained vector 
instructions- '•as with vector clip, for example, where every element greater 
than 5 is replaced with 5. Other typical operations using these instruction 
set primitives include: operations on sparse vectors using a compress 
operation, the number and location of zero crossings; the number of suc- 
cessful comparisons; sorts, and others. 

Three forms of compare operations are provided: .LE., .LT., and .EQ. For 
the other 3 operations, the following identities hold: .NOT. (V .REL. S) 
<=> V (.NOT. .REL.) S, where the following relations are complements: 

.NE. <--> .EQ. 
.LE. <--> .GT. 
.LT. <--> .GE. 



14.1 Vector 

The two instructions defined in this section are Compare Vector/Vector, 
(used between two vectors) and Compare Vector /Scalar (used between one vec- 
tor and one scalar) . Note: the instruction Compare Scalar /Scalar, covered 
in Chapter 10--"Scalar Register Instruction Set"--is used between two 
scalars. The three Compare instructions are It, le, and eq: 

(It, le, eq)x(b,h,w,l,s,d) 



There are six different data types: integer 8, 16, 32, and 64, and single 
and double precision floating point (32 and 64 bit) . The only flag 
affected in the PSW is the RO bit, the Reserved Operand used for Floating 
Point only. 
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14 . 2 Ma^/Meras/Cflmprsss 

The instructions defined in this section are Compress, Merge Vector/Vector, 
Merge Vector/Scalar, Mask Vector/Vector, and Mask Vector/Scalar: 

CPRS MERG mask 

The Compress instruction uses the VM register to extract elements selec- 
tively from one vector register and place them in another. Either O's or 
l's of VM may be used by specifying the .f or .t (false or true) version of 
the instruction, respectively. Both Mask and Merge instructions take two 
input operands and produce a third operand as the result. These operands 
are referred to as Vi, R j , and Vk, where Vk is the output. Rj may be 
either a vector or a scalar register. The Merge and Mask instructions 
differ only in the way in which the indices are used to create the result 
vector. For the Mask instruction, element n of Vk is either element n of 
Vi or element n of Rj . In the case of the Merge instruction, the indices 
of Vi and Rj are only incremented if that particular register is selected 
by VM. 

These instructions are best described by a few examples. First, a simple 
rule is presented. Each instruction either has a single, "true" version, 
or both a true (.t) and a false (.f) version. -This facility allows the 
user to utilize either the l's of VM (.t) or the 0*s ( . f ) . Thus, in the .t 
case, when the appropriate bit of VM is a 1, the Rj operand is selected. 
The various combinations of VM, .t, and .f are shown in the following 
diagram. 

VM 
1 



• t | Vi Rj 

I 
•f I Rj Vi 



14.2.1 Mask /Merge/ Comp ress Examp les 

Examples of compress, mask, and merge are. now presented. The following 
values are assumed before instruction execution: 
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VO = 012345 

Vl=abcdef 

VM = 011001 

VL = 6 

SI = 8 
Performing a compress on VO produces the following: 

12 5= cprs.t V0,V5 

3 4= cprs.f V0,V5 
Performing a mask of VO and VI produces the following: 

b c 3 4 f = mask.t V0,V1,V5 

and: 

a 1 2 d e 5 = mask.t V1,V0,V5 

and: 

8 8 3 4 8= mask.t V0,S1,V5 
Performing a merge of VO and VI produces the following: 

0abl2c456def= merg.t V0,V1,V5 

where VL = 12, and 

VM=011001000111 
Performing a merge of VO and SI produces the following: 

088128345888= merg.t V0,S1,V5 

or: 

801882888345= merg.f V0,S1,V5 

where VL = 12, and 

VM=011001000111 
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COMPARE VECTOR/VECTOR 



(le|lt|eq) . (b|h|w|l|s|d|) Vj,Vk 



Purpose : 



To compare two vectors and load VM 



Format : 



Opcode 



Vj 



| Vk 



15 



6,5 



3,2 



Operation : 

DO 10 



10 



a = 0, (VL-1) 

IF(Vj .OPCODE -TEST. Vk (a) ) THEN 

VM(a) = 1 
ELSE 

VM(a) =0 
CONTINUE 



PSW: 



RO = Reserved Operand ; Floating Point Only 



Exceptions : 

Reserved Operand (floating point only) 



Opcode 



le.b Vj,Vk 
lt.b Vj,Vk 
eq.b Vj,Vk 

le.h Vj,Vk 
lt.h Vj,Vk 
eq.h Vj,Vk 

le.w Vj,Vk 
lt.w Vj,Vk 
eq . w V j , Vk 

le.l Vj,Vk 
lt.l Vj,Vk 
eq.l Vj,Vk 

le.s Vj,Vk 
lt.s Vj,Vk 
eq.s Vj,Vk 

le.d Vj,Vk 
lt.d Vj,Vk 
eq.d Vj,Vk 



0110101000 Compare less than or equal byte 

0110110000 Compare less than byte 

0110100000 Compare equal byte 

0110101001 Compare less than or equal half word 

0110110001 Compare less than halfword 

0110100001 Compare equal halfword 

0110101010 Compare less than or equal word 

0110110010 Compare less than word 

0110100010 Compare equal word 

0110101011 Compare less than or equal longword 

0110110011 Compare less than longword 

0110100011 Compare equal longword 

0110011000 Compare less than or equal single 

0110011010 Compare less than single 

0110010000 Compare equal single 

0110011001 Compare less than or equal double float 

0110011011 Compare less than double float 

0110010001 Compare equal double precision 
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Description : 

The elements of the Vj vector register are signed compared with the 
elements of the Vk register. The results of the compare are loaded 
into VM. The number of compares is equal to VL. VM(n) is loaded 
with the result of the compare between Vk (n) and Vj (n) . When VL is 
less than 128, the remaining bit positions of VM are reset to 0. 

Notes: 

1 There are no unsigned vector compares. 

2 The pic instruction can be used to determine the number 
of successful compares. 

3 By removing VM to a scalar register and performing a 
leading zero count, the index of the first successful 
compare can be performed. 
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COMPARE VECTOR/SCALAR 



(le|lt|eq). (b|h|w|l|s|d|) Sj,Vk 



Purpose : 



To compare a vector and a scalar and load VM 



Format: 



15 

Operation : 

DO 10 



Opcode | Sj | Vk | 
6,5 3,2 



a = 0, (VL-1) 

IF (Sj .OPCODE-TEST. Vk (a) ) THEN 

VM(a) =1 
ELSE 

VM(a) =0 
10 CONTINUE 



PSW: 



RO = Reserved Operand ; Floating Point Only 



Exceptions : 

Reserved Operand (floating point only) 



Opcode: 



le.b Sj,Vk 
lt.b Sj,Vk 
eq.b Sj,Vk 

le.h Sj,Vk 
lt.h Sj,Vk 
eq.h Sj,Vk 

le.w Sj,Vk 
lt.w Sj,Vk 
eq.w Sj,Vk 

le.l Sj,Vk 
lt.l Sj,Vk 
eq.l Sj,Vk 

le.s Sj,Vk 
lt.s Sj,Vk 
eq . s S j , Vk 

le.d Sj,Vk 
lt.d Sj,Vk 
eq.d Sj,Vk 



0110101100 
0110110100 
0110100100 

0110101101 
0110110101 
0110100101 

0110101110 
0110110110 
0110100110 

0110101111 
0110110111 
0110100111 

0110011100 
0110011110 
0110010100 

0110011101 
0110011111 
0110010101 



Compare less than or equal byte 
Compare less than byte 
Compare equal byte 

Compare less than or equal halfword 
Compare less than halfword 
Compare equal halfword 

Compare less than or equal word 
Compare less than word 
Compare equal word 

Compare less than or equal longword 
Compare less than longword 
Compare equal longword 

Compare less than or equal single 
Compare less than single 
Compare equal single 

Compare less than or equal double float 
Compare less than double float 
Compare equal double precision 
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Description : 

Sj is signed compared with the elements of the Vk register. The 
results of the compare are loaded into VM. The number of compares 
is equal to VL. VM{n) is loaded with the result of the compare 
between Vk (n) and S j . When VL is less than 128, the remaining bit 
positions of VM are reset to 0. 

Notes : 

There are no unsigned vector compares. 
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COMPRESS _„_„ ,+. . _.* ... ,„ 

cprs . (t | f ) Vj , Vk 



Purpose : 

To compress a vector using VM. 

Format : 

I Opcode | Vj | Vk | 

15 6,5 3,2 

Operation : 

Vk = VM Compress Vj 
a = 

DO 10 b = 0, (VL-1) Jcprs.f 

IF (VM(b) .EQ. O) THEN !0 for false 
Vk(a) = Vj(b) 
a = a + 1 

END IF 
10 CONTINUE 

DO 10 b = O, (VL-1) icprs.t 

IF (VM(b) .EQ. 1) THEN !1 for true 
Vk (a) = Vj (b) 

a = a + 1 
END IF 
10 CONTINUE 

PSW: 
Exceptions : 

Opcode : 

cprs.f Vj,Vk 0110001110 Compress a vector using not VM 
cprs.t Vj,Vk 0110001111 Compress a vector using VM 

Description : 

The vector Vj is compressed using VM. The result is loaded into Vk. 
The number of elements loaded into Vk is equal to the number of 
0/1' s in VM, up to VL. Vj and VM are unchanged upon completion of 
the instruction. All 64-bits of a vector element participate in the 
compress operation. 

Notes : 

1 The compress operation provides a useful means to 
operate on vector elements that satisfy a constraint 
condition. Since compress is a pipelined operation, the 
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use of scalar operations or compares and branches to 
achieve the same result is avoided. 

2 The compress operation can be useful for implementing a 
particular class of binary sorts. 

3 The pic VM instruction should be used to determine the 
number of elements loaded into Vk . 
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MERGE VECTOR/VECTOR merg.t Vi,Vj,Vk 



Purpose : 

To MERGE one vector into another. 

Format : 

| Opcode | Vi | Vj | Vk | 

15 9,8 6,5 3,2 

Operation: 

a = 
b = 

DO 10 e = 0, (VL-1) 

IF (VM(e) .eq. 1) THEN 
Vk(e) = Vj(b) 
b = b + 1 
ELSE 

Vk(e) = Vi(a) 
a = a + 1 
END IF 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

merg.t Vi,Vj,Vk 1000010 Merge vector/vector 

Description : 

The vectors Vi and Vj are merged into the vector Vk using VM. The 
number of merge operations is equal to VL. Vi, V j , and VM are 
unchanged after the merge operation is completed. The merge opera- 
tion moves sequential elements from either the Vi or Vj vectors to 
Vk according to values contain in VM. 
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Notes : 

1 The merge provides a convenient means by which to 
reassemble operands from two vectors into one vector. 
Typically, the operands were initially scrambled using a 
compress operation . 

2 Merge using .NOT. VM is equivalent to MERGE with Vi and 
Vj interchanged. 
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MERGE VECTOR/SCALAR merg.(t|f) Vi,Sj,Vk 



Purpose : 

To MERGE a scalar into a vector. 

Format: 

I Opcode I Vi | Sj | Vk | 

15 9,8 6,5 3,2 

Operation: 

b = 

IF (MERG.T) THEN 

BIT = 1 
ELSE 

BIT = 
ENDIF 
DO 10 a = 0, (VL-1) 

IF (VM(a) .EQ. BIT) THEN 

Vk(a) = Sj 
ELSE 

Vk(a) = Vi(b) 
b = b + 1 
END IF 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

merg.t Vi,Sj,Vk 1000110 Merge vector/scalar 

merg.f Vi,Sj,Vk 1000100 Merge vector/scalar using not VM 

Description : 

The scalar Sj and the vector Vi are merged into the vector Vk using 
VM. The number of merge operations is equal to VL. S j , Vi, and VM 
are unchanged after the merge operation is completed. 

Notes : 

The merge provides a convenient means by which to reassemble 
operands into one vector. Typically, the operands were initially 
scrambled using a compress operation. 
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MASK VECTOR/VECTOR mask.t Vi,Vj,Vk 



Purpose: 

To MASK one vector into another . 

Format: 

| Opcode | Vi | Vj | Vk | 

15 9,8 6,5 3,2 O 

Operation: 

Vk = VM MASK (Vi/ Vj) 

DO 10 a = 0, (VL-1) 

IF (VM(a) .EQ. 1) THEN 

Vk(a)= Vj(a) 
ELSE 

Vk(a)= Vi(a) 
END IF 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

mask.t Vi,Vj,Vk 1000011 Mask vector/vector 

Description: 

The vectors Vi and Vj are masked into the vector Vk using VM. The 
number of mask operations is equal to VL. Vi, Vj, and VM are 
unchanged after the mask operation is completed. 



Notes : 



Rearranging Vi and Vj is equivalent to using .NOT. VM. 
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MASK VECTOR/SCALAR mask . (t | f ) V i , S j , Vk 



Purpose: 

To MASK a scalar into a vector 

Format : 

| Opcode | Vi | Sj | Vk | 

15 9,8 6,5 3,2 

Operation : 

Vk = VM MASK (Vi, Sj) 

IF (MASK.T) THEN 

BIT = 1 
ELSE 

BIT =0 
ENDIF 

DO 10 a = 0, (VL-1) 

IF (VM(a) .EQ. BIT) THEN 

Vk(a)= Sj 
ELSE 

Vk(a)= Vi(a) 
END IF 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

mask.t Vi,Sj,Vk 1000111 Mask vector/scalar using VM 
mask.f Vi,Sj,Vk 1000101 Mask vector/scalar using not VM 

Description: 

The scalar Sj and the vector Vi are masked into the vector Vk using 
VM. The number of mask operations is equal to VL. S j , Vi, and VM 
are unchanged after the mask operation is completed. 
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Notes : 

1 The mask scalar operation provides a convenient means to 
perform the vector clip operation. Typically, the clip 
threshold is used to compute a bit vector loaded into 
VM. The mask scalar operation is then used to load the 
threshold value into the vector elements whose values 
exceeded this threshold. 

2 IF VM is all l's, the scalar Sj is extended and loaded 
into all the elements of Vk, for mask.t. This is a con- 
venient way of loading a scalar into all elements of a 
vector accumulator. 
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CHAPTER 15 
15 Vectsx Reduction Instruction Set 



Reduction operations reduce a vector to a scalar. There are two inputs to a 
reduction operator: one input is a scalar register, the other input a vec- 
tor register. A scalar input is provided so that reduction operations can 
be performed for vectors greater than 128 elements. Mathematically, reduc- 
tion operations are the SUM (sum reduction) and PROD (multiply or product 
reduction) . Additional reduction operations are provided to implement the 
FORTRAN MAX and MIN intrinsics, as well as reduction using logical opera- 
tors, such as AND (ALL) , OR (ANY) , and XOR (PARITY). These latter reduction 
operations are used for a certain class of pattern recognition algorithms. 

For these operations, the scalar register Sk must be initialized to the 
identity operand of the particular operation. Thus, for add the identity 
operand is 0, for multiply 1, for and all l's, for OR all 0's, for XOR all 
0's, for MAX the smallest number, and for MIN the maximum number. Other 
operations are provided to manipulate the VM register. These take the form 
o f reductions on VM to determine the number of l's or 0's. 



15.1 

The Instructions defined in this section are Sum, Product, Max, and Min 
Vector on the following data types: (b|h|w| 1 |s |d) . The flags in the PSW 
affected by these instructions are as follows: 

SIV = Integer Overflow 
0V = Exponent Overflow 
UN = Exponent Underflow 
R0 = Reserved Operand 



15.2 Logical Reductions 

This section supports the following instructions: OR Reduce Vector (ANY), 
AND Reduce Vector (ALL) , and EXCLUSIVE OR Reduce Vector (PARITY) . None of 
the flags in the PSW are affected by these instructions. 



15.3 Regulation Count Ysctar 

The Population Count Vector instruction affects no flags in the PSW. It 
returns the number of ones in each element of the input vector. 
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SUM VECTOR 



sum. (b|h|w|l|s|d) Vk 



Purpose : 

To SUM all the elements of a vector. 

Format : 

| Opcode | Vk/Sk | 

15 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

Sk = Sk + Vk(a) ! See notes below. 
10 CONTINUE 



PSW: 

SIV = Integer Overflow 
UN as Exponent Underflow 
OV — Exponent Overflow 
RO = Reserved Operand 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 

Opcode : 



sum.b Vk 0111111000000 Sum a vector of bytes 

sum.h Vk 0111111000001 Sum a vector of half words 

sum.w Vk 0111111000010 Sum a vector of words 

sum.l Vk 0111111000011 Sum a vector of longwords 

sum.s Vk 0111111010000 Sum a vector of single float 

sum.d Vk 0111111010001 Sum a vector of double float 

Description: 

The sum of the scalar register Sk and all elements of Vk until VL 
is calculated. The number of elements added is determined by VL. 
and the result is loaded into Sk. 

Notes : 

1 The scalar register should be initialized to zero for 
the first use of the summation instruction. 

2 The sequence of the sum executed by the hardware is NOT 
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identical to the FORTRAN sequence as noted above. 
Please see the Hardware Reference Manual for the exact 
sequence of operations. 

3 Since this instruction reduces a Vk and Sk into Sk, the 
assembler will accept either Vk or Sk as the argument to 
this instruction. 
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PRODUCT VECTOR 



prod. (b|h|w|l |s|d) Vk 



Purpose : 



To obtain the products of all the elements of a vector. 



Format: 



Opcode 



15 
Operation: 



| Vk/Sk | 
3,2 



DO 10 a = 0, (VL-1) 
Sk = Sk * Vk(a) 
10 CONTINUE 



! See notes below. 



PSW: 

SIV — Integer Overflow 
OV = Exponent Overflow 
UN = Exponent Underflow 
RO = Reserved Operand 
1 

Exceptions : 

Integer Overflow 
Exponent Overflow 
Exponent Underflow 
Reserved Operand 



Opcode : 



prod.b Vk 
prod.h Vk 
prod.w Vk 
prod.l Vk 
prod.s Vk 
prod.d Vk 



0111111011000 
0111111011001 
0111111011010 
0111111011011 
0111111010010 
0111111010011 



Multiply 


reduce 


a 


vector 


of 


bytes 


Multiply 


reduce 


a 


vector 


of 


halfwords 


Multiply 


reduce 


a 


vector 


of 


words 


Multiply 


reduce 


a 


vector 


of 


longwords 


Multiply 


reduce 


a 


vector 


of 


single flo 


Multiply 


reduce 


a 


vector 


of 


double flo, 



Description : 

The product of the scalar register Sk and all elements of Vk is 
calculated. The number of elements multiplied is determined by VL, 
and the result is loaded into Sk. 



Notes : 



1 The scalar register should be initialized to one for the 
first use of the multiply reduce instruction. 

2 The sequence of the products performed by the hardware 
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is NOT identical to the FORTRAN sequence as noted above. 
Please refer to the Hardware Reference Manual for the 
exact sequence of operations. 

3 Since this instruction reduces a Vk and Sk into Sk, the 
assembler will accept either Vk or Sk as the argument to 
this instruction. 
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MAX VECTOR 



max. (b|h|w|l|s|d) Vk 



Purpose: 



To find the maximum element of a vector, 



Format : 



Opcode 



15 



I Vk/Sk | 
3,2 



Operation : 

DO 10 a = 0, (VL-1) 

IF (Vk(a).GT.Sk) THEN 

Sk = Vk (a) 
ENDIF 
10 CONTINUE 



PSW: 



RO = Reserved Operand 



Exceptions : 

Reserved Operand; 



floating point 



Opcode s 



max.b Vk 
max.h Vk 
max.w Vk 
max . 1 Vk 
max.s Vk 
max.d Vk 



0111111001000 
0111111001001 
0111111001010 
0111111001011 
0111111010100 
0111111010101 



Max of 
Max of 
Max of 
Max of 
Max of 
Max of 



vector of bytes 
vector of halfwords 
vector of words 
vector of longwords 
vector of single float 
vector of double float 



Description : 

The maximum of Sk and all the elements of Vk is determined. The 
number of elements searched is determined by VL, and the maximum 
element is loaded into Sk. 



Notes : 



1 The scalar register should be initialized to the minimum 
value for the first use of the max instruction. 

2 Since this instruction reduces a Vk and Sk into Sk, the 
assembler will accept either Vk or Sk as the argument to 
this instruction. 
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MIN VECTOR 



man. (b|h|w|l |s|d) Vk 



Purpose t 





To 


find 


the 


mx 


nimum element 


of a 


vector 


Format: 


















1 


Opcode 




| Vk/Sk | 






15 








3,2 




Operation: 
DO 


10 


a = 0, (VL-1) 
IF (Vk(a) .LT. Sk) ' 
Sk = Vk (a) 

ENDIF 


THEN 





10 CONTINUE 



PSW: 



RO = Reserved Operand 



Exceptions : 

Reserved Operand; 

Opcode : 



floating point only 



min.b 


Vk 


0111111001100 


Min 


of 


a 


min.h 


Vk 


0111111001101 


Min 


of 


a 


min.w 


Vk 


0111111001110 


Min 


of 


a 


min.l 


Vk 


0111111001111 


Min 


of 


a 


min.s 


Vk 


0111111010110 


Min 


of 


a 


min.d 


Vk 


0111111010111 


Min 


of 


a 



vector of bytes 
vector of half words 
vector of words 
vector of longwords 
vector of single float 
vector of double float 



Description : 

The minimum of Sk and all the elements of Vk is determined. The 
number of elements searched is determined by VL, and the minimum 
element is loaded into Sk. 



Notes : 



1 The scalar register should be initialized to the maximum 
value for the first use of the MIN instruction. 

2 Since this instruction reduces Vk and Sk into Sk, the 
assembler will accept either a Vk or Sk as the argument 
to this instruction . 
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AND REDUCE VECTOR all Vk 



Purpose : 

To AND reduce all the elements of a vector. 

Format: 

| Opcode | Vk/Sk | 

15 3,2 

Operation: 

DO 10 a = 0, (VL-1) 

Sk = Sk .AND. Vk(a) 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

all Vk 0111111000100 AND reduce a vector 

Description : 

The AND of the scalar register Sk and all elements of Vk is calcu- 
lated. The number of elements ANDed is determined by VL. The 
result is loaded into Sk. If all the corresponding bits in Sk and 
Vk are a 1, then a 1 is loaded in the corresponding bit position of 
the result Sk. 



Notes : 



1 The scalar register should be initialized to one for the 
first use of the AND reduce instruction. 

2 All 64-bits of each element participate in the reduction 
operation. 

3 Since this instruction reduces Vk and Sk into Sk, the 
assembler will accept either a Vk or Sk as the argument 
to this instruction. 
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OR REDUCE VECTOR any Vk 



Purpose : 

To OR reduce all the elements of a vector. 



Format : 






1 


Opcode 


| Vk/Sk | 


15 

Operation : 
DO 

10 


10 a = 0, 
Sk = 

CONTINUE 


3,2 

(VL-1) 

Sk .OR. Vk(a) 



PSW: 

Exceptions : 

Opcode : 

any Vk 0111111000101 OR reduce a vector 

Description : 

The OR of the scalar register Sk and all elements of Vk is calcu- 
lated . The number of elements ORed is determined by VL. The 
result is loaded into Sk. If any of the corresponding bits in Sk or 
Vk(i) is a 1, then a 1 is loaded into the corresponding bit posi- 
tion in the result loaded into Sk. 



Notes : 



1 The scalar register should be initialized to zero for 
the first use of the ANY instruction. 

2 All 64-bits of each element participate in the reduc- 
tion. 

3 Since this instruction reduces Vk and Sk into Sk, the 
assembler will accept either a Vk or Sk as the argument 
to this instruction . 
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EXCLUSIVE OR REDUCE VECTOR parity Vk 



Purpose : 

To EXCLUSIVE OR reduce all the elements of a vector. 

Format: 

| Opcode | Vk/Sk | 

15 3,2 

Operation : 

DO 10 a = 0, (VL-1) 

Sk = Sk .Exclusive OR. Vk (a) 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

parity Vk 0111111000110 Exclusive OR reduce a vector 

Description : 

The exclusive OR of the scalar register Sk and all elements of Vk 
is calculated. The number of elements exclusively ORed is deter- 
mined by VL. The result is loaded into Sk. 



Notes ; 



1 The scalar register should be initialized to zero for 
the first use of the reduction instruction. 

2 All 64 bits of each vector element participate in the 
operation. 

3 Since this instruction reduces Vk and Sk into Sk, the 
assembler will accept either a Vk or Sk as the argument 
to this instruction. 
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POPULATION COUNT VECTOR plc.t Vj,Vk 



Purpose: 

To count the number of one's in each vector element. 

Format : 

| Opcode | Vj | Vk | 

15 6,5 3,2 

Operation : 

DO 10 a = 0, (VL-1) 
Vk(a) =0 
DO 10 j = 0, 63 

IF (Vj<b> .EQ. 1) THEN 

Vk(a) = Vk(a) +1 
ENDIF 
10 CONTINUE 

PSW: 

Exceptions : 

Opcode : 

plc.t Vj,Vk 0110001101 Population Count of a Vector 

Description : 

The number of l's in each vector element of Vj is loaded in the 
corresponding element of Vk. The number loaded is zero extended on 
the left. The number of vector elements is equal to VL. 



Notes : 



The parity of the population count of each element can be calcu- 
lated by performing a population count and then by performing a 
vector and Vi,Sj,Vk with the scalar register loaded with a 1 in the 
least significant bit, and O's elsewhere. 
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CHAPTER 16 
16 YL, Y3, and Yfci Instnictifin Set 



This chapter is divided into two sections: VL and VS operators; and VM 
Operations . 



16.1 YL and YS 



Included in this section are the following instructions: Move Address/VL; 
Load VL/Immediate; Move Address/VS; Load VS Immediate; Load VS and VL; 
Store VS and VL; Move Scalar /VL, and Move Scalar/VS. None of the flags in 
the PSW are affected by these operations. 



16.2 



The VM operations include the following: Load VM; Store VM; Population 
Count VM, and Move VM/Scalar. 
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MOVE ADDRESS/VL mov Ak,VL 



Purpose : 

To move the contents of an address register to the vector length 
register, VL. 

Format : 

I Opcode | Ak | 

IS 3,2 

Operation : 

IF (Ak .GE. 128) THEN ; mov Ak,VL 

VL = 128 
ELSE 

IF (Ak .LT. 0) THEN 

VL = 
ELSE 

VL = Ak<6 . . 0> 
END IF 
END IF 

Ak = VL ! mov VL,Ak 

PSW: 

Exceptions : 

Opcode : 

mov Ak,VL 0111110110011 Move Ak to VL 
mov VL,Ak OlllllOllOOlO Move VL to Ak 

Description : 

The contents of the A register are moved to VL. If the contents of 
Ak are greater than 128, then 128 is loaded into VL. If the con- 
tents of Ak are less than 128, then Ak is loaded into VL. When VL 
is moved to Ak, Ak<31..7> are loaded with 0. 

Notes : 
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LOAD VL/IMMEDIATE ld.w #N,VL 



Purpose : 

To load the vector length register with an immediate 

Format: 

| Opcode |L| 000 | Ak | | N | 

15, 6,5 3,2 31|16 

Operation : 

IE (Immediate .GE. 128) THEN 

VL = 128 
ELSEIF (Aj .LT. 0) THEN 
VL = 
ELSE 

VL = Immediate<6. .0> 
END IF 
END IF 

PSW: 

Exceptions : 

Opcode : 

ld.w #N,VL 000110000 Load VL with an immediate 

Description : 

The immediate field is used to load VL. If the immediate field is 
greater than 128 then 128 is loaded into VL. If the immediate is 
less than 128 then immediate<6. .0> is loaded into VL. 

Notes : 
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VL, VS, and VM Instruction Set 

MOVE ADDRESS/VS mov ^^ 



Purpose : 

To move the contents of an address register to/from the vector 
stride register, VS. 



Format: 




| Opcode | Ak 


15 3,2 





Operation : 




VS = Ak ! mov Ak,VS 




Ak = VS ! mov VS,Ak 




PSW: 




Exceptions: 




Opcode: 




mov VS,Ak 01111101K 


D0< 


mov Ak,VS 01111101K 


DOC 



Description: 

The contents of the A register are moved to/ from VS, 

Notes : 
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VL, VS, and VM Instruction Set 

LOAD VS/IMMEDIATE Id.w #N,VS 



Purpose : 

To load the vector stride register from an immediate 

Format: 

| Opcode |L| 000 | Ak | | N | 

15, 6,5 3,2 31 | 16 

Operation: 

VS = Immediate 

PSW: 

Exceptions : 

Opcode : 

ld.w #N,VS 000110001 Load VS from an immediate 



Description: 

The immediate field is loaded into VS, 

Notes : 
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VL, VS, and VM Instruction Set 
LOAD VS AND VL 



ld.l <effa>,VLS 



Purpose : 

To load the vector stride register and the vector length register 
from memory 

Format : 



I Opcode |@|L| Aj | Ak | | Displacement | 
15 8,7.6,5 3,2 (31,15) 

Operation : 

VS = c (Effective Address<63. . 31>) ! VS loaded from higher order 32 bit 
VL - c (Effective Address<31. .0>) ! VL loaded from lower order 32 bits 

PSW: 

Exceptions : 

Opcode : 

ld.l <effa>,VLS 000010100 Load VS and VL from memory 

Description : 

The 64 bit operand in memory is fetched. Bits<63..32> are loaded 
into the vector stride register, VS. Bits<31. .0> are loaded into 
the vector length register, VL. 

Notes : 

1 VL is unconditionally loaded with the exact contents of 
memory. 
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VL, VS, and VM Instruction Set 

STORE VS AND VL st.l VLS, <effa> 



Purpose s 

To store the vector stride register and the vector length register 
to memory 

Format: 

| Opcode |@|L| Aj | Ak | | Displacement | 

15 8,7,6,5 3,2 (31,15) 

Operation: 

c (Effective Address<63. .31>) = VS ! VS stored into higher order 32 bits 
c (Effective Address<31. .0>) = VL ! VL stored into lower order 32 bits 

PSW: 

Exceptions : 

Opcode : 

st.l VLS,<effa> 000011100 Store VS and VL to memory 



Description: 

A 64-bit operand is stored in memory. Bits<63..32> are stored 
from the vector stride register, VS. Bits<31..0> are stored from 
the vector length register, VL. 

Notes : 
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VL, VS, and VM Instruction Set 
MOVE SCALAR/VL 



mov.w Sk / VL 



Purpose : 

To move the contents of Sk to VL . 

Format: 

I Opcode | Ak | 

15 3,2 

Operation: 

IF (Sk .GE 128) THEN ! mov.w Sk VL 

VL = 128 
ELSE 

IF (Sk .LT. 0) THEN 

VL = 
ELSE 

VL = Sk<6..0> 
ENDIF 
ENDIF 

PSW: 

Opcode : 

mov.w Sk,VL 0111110110111 Move Sk to VL 

Description : 

The least significant 32 bits of Sk (Sk<31..0>) are moved to VL. 
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VL, VS, and VM Instruction Set 

MOVE SCALAR/VS mov.w Sk,VS 



Purpose : 

To move the contents of Sk to VS . 



Format: 






| Opcode 


| Ak 


1 


15 


3,2 





Operation: 






VS = Sk ! mov.w, 


Sk, VS 




PSW: 






Opcode : 







mov.w Sk,VS 0111110110101 Move Sk to VS 

Description: 

The least significant 32 bits of Sk (Sk<31..0>) are moved to VS. 
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VL, VS, and VM Instruction Set 
LOAD VM 



ld.x <effa>, VM 



Purpose: 

To load the VM register from memory 

Format : 

| Opcode |@|L| Aj | Ak | | Displacement | 
15 8,7,6,5 3,2 (31,15) 



Operation: 



VM = c (Effective Address<127..0>) ! Load 128 bits beginning at 
<effa> 

PSW: 

Exceptions : 

Opcode : 

ld.x <effa>, VM 000010110 Load VM from memory 

Description : 

The 128 bits (16 bytes) , beginning at the effective address are 
loaded into VM. 

Notes : 

1 VM <127..120> are loaded from the byte referenced by the 
effective address. VM <7..0> are loaded from the byte 
referenced by effective address + 15. 
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VL, VS, and VM Instruction Set 

STORE VM st.x VM,<effa> 



Purpose: 

To store the VM register into memory 

Format : 

| Opcode |@|L| Aj | Ak | | Displacement | 

15 8,7,6,5 3,2 (31,15) 

Operation : 

c (Effective Address<127. .0>) = VM ! Store 128 bits beginning at 
Effective Address. 

PSW: 

Exceptions : 

Opcode : 

st.x VM, <effa> 000011110 store VM into memory- 



Description : 

VM is stored into the 128 bits (16 bytes) , beginning at the effec- 
tive address. 

Notes : 

1 VM <127..120> are stored in the byte referenced by the 
effective address. VM <7..0> are stored in the byte 
referenced by the effective address + 15. 
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VL, VS, and VM Instruction Set 
POPULATION COUNT VM 



pic. (t|f) VM,Sk 



Purpose: 



To load the number of ones or zeros in VM into an S register. 



Format ! 



Opcode 



15 
Operation: 



I Sk J 
3,2 



IF (PLC.T) THEN 

DO 10 a = 0, (VL-1) 

Sk = Sk + VM<a> 
10 CONTINUE 
ELSE 

DO 10 a = 0, (VL-1) 

Sk = Sk + .NOT. VM<a> 
10 CONTINUE 
ENDIF 



Exceptions : 



Opcode: 



plc.f VM,Sk 
plc.t VM,Sk 



0111111011100 
0111111011101 



Load the number of O's in VM into Sk 
Load the number of l*s in VM into Sk 



Description: 

The number of l's or O's in VM until VL is loaded into Sk, bits 
6..0. All other Sk bits are reset to 0. 



Notes : 



These instructions typically determine the number of successful 
compare operations performed. The VM set by the compare could also 
be used to compress a vector register, and then further processing 
would use a VL value determined by the pic VM instructions. 
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VL, VS, and VM Instruction Set 
MOVE VM/SCALAB. 



mov Sj,VM,Sk 



Purpose s 



To move VM to and from a scalar register. 



Format : 



| Opcode 
15 



| Sj | Sk | 
6.5 3,2 



Operation : 

Sk = VM(Sj) ! use Sj<0> to index VM<127..64> or VM <63..0> 
VM(Sj) = Sk ! move Sk to VM 

PSW: 

Exceptions : 

Opcode : 

mov Sj,Sk,VM 0110000100 Load VM(Sj) from Sk. 
mov Sj,VM,Sk 0110000101 Load Sk from VM 

1 

Description: 

The contents of Sj determine which part of VM is manipulated. If 
the least significant bit in Sj is a 0, VM<63..0> are manipulated. 
If Sj is a 1, VM<127..63> are manipulated. Sk is moved to and from 
VM(Sj) . Bits<63..1> of Sj are ignored. 



Notes ; 



Typically, a move of VM into the Sk scalar register is used to 
determine all of the elements that satisfy a multi-value logical 
relation. These logical relations may involve zero crossing algo- 
rithms or all elements that are between several boundary conditions 
(greater than x and less than y) . 
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APPENDIX A 
NOTATIONAL CONVENTIONS 

This text utilizes the notational conventions listed below: 



o Bit numbering is right to left, through N-l. The most signifi- 
cant numerical bit is N-l, the least significant 0. In essence, 
the bit numbering represents the binary weight of a position. 

o Bit fields are specified using the following convention: 

REG<15. .0> 

where the bit field is REG from bits 15 through 0. 
o Individual bit positions within a register are denoted by 

specific positions separated by commas. For example, 

REG<15,4,0> denotes bits 15, 4, and of REG. 
o Byte numbering is from left to right, 
o A bit is a single binary value or entity, 
o A byte is 8 bits, 
o A halfword is 16 bits, 
o A word is 32 bits, 
o A longword is 64 bits. 

o Single precision is a 32 bit floating point word, 
o Double precision is a 64 bit floating point longword. 
o An instruction is a multi -halfword operand, 
o A register is a programmer visible hardware storage element 

internal to the processor, 
o Main memory or physical memory is the physical storage present 

in the computer system. 
o Logical or virtual memory or memory is the perceived amount of 

main memory as seen by the application programmer. 
b^The* symbol K is an abbreviation for 1,024. 
o T$te>' symbol M is an abbreviation for 1,048,576. 
o The symbol G is an abbreviation for 1,073,741,824. 
o TBD means to be determined, 
o A 'stack is a linked-list group of words. A stack is useful for 

dynamic allocation and deallocation of memory, 
o A return Block is a collection of registers that is pushed or 

popped from a stack in response to an instruction or other 

event . 



Where used in the document, the terms "reserved" or "undefined" are meant 
to convey to the hardware and software engineer what to expect, if any- 
thing, from unused fields in registers. The programming of algorithms 
which are based on the use of undefined or reserved fields is not recom- 
mended . 

Wherever feasible, the FORTRAN language (F'77 and 8x) will be used as a 
metalanguage to describe algorithms. All of the proper FORTRAN syntax and 
semantics will be used. For example, the symbol "G" as defined above is 
represented by 2**30 in FORTRAN. Comments on a line are indicated with a 
" ; " preceding the comment . 
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APPENDIX B 
OP CODES SORTED BY NUMBER 



OxOOOO 

0x0100 

0x0200 

0x0300 

0x0400 

0x0500 

0x0600 

0x0700 

0x0900 

OxOaOO 

OxObOO 

OxOcOO 

OxOdOO 

OxOeOO 

0x0 f 00 

0x1000 

0x1008 

0x1080 

0x1088 

0x1088 

0x1088 

0x1100 

Oxll08 

0x1180 

0x1188 

Oxll88 

0x1188 

0x1200 

0x1208 

0x1280 

0x1288 

0x1300 

0x1308 

0x1380 

0x1388 

0x1400 

0x1408 

0x1480 

0x1488 

0x1500 

0x1508 

0x1580 

0x1588 

0x1600 

0x1608 

0x1680 

0x1688 

0x1700 

0x1708 

0x1780 

0x1788 



11-7 

11-7 

11-7 

11-7 

11-7 

11-7 

11-7 

11-7 

9-26 

16-6 

16-10 

9-32 

9-27 

16-7 

16-11 

12-20 

10-6 

11-16 

10-6 

10-6 

10-6 

9-6 

10-6 

9-6 

10-6 

10-6 

10-6 

9-21 

10-21 

9-22 

10-22 

9-23 

10-23 

9-25 

10-24 

9-12 

10-17 

9-12 

10-17 

9-13 

10-18 

9-13 

10-18 

9-14 

10-19 

9-14 

10-19 

9-15 

10-20 

9-15 

10-20 



exit 

jmp <effa> 
jmpi. f <ef fa> 
jropi.t <effa> 
jmpa.f <effa> 
jmpa.t <effa> 
jmps.f <effa> 
jmps.t <effa> 
Idea <effa>,Ak 
ld.l <effa>,VLS 
ld.x <effa>, VM 
tas <effa> 
pshea <effa> 
st.l VLS,<effa> 
st.x VM, <effa> 
halt #N,Ak 
ld.d #N,Sk 
sysc #r,#g 
ld.du #N,Sk 
ld.lu #N,Sk 
ld.u #N,Sk 
ld.h #N,Ak 
ld.l #N,Sk 
ld.w #N,Ak 
ld.dl #N,Sk 
Id. 11 #N,Sk 
ld.w #N, Sk 
and #N,Ak 
and #N,Sk 
or #N,Ak 
or #N,Sk 
xor #N,AK 
xor #N,Sk 
shf #N,AK 
shf #N,Sk 
add.h #N,Ak 
add.h #N,Sk 
add.w #N,Ak 
add.w #N,Sk 
sub.h #N,Ak 
sub.h #N,Sk 
sub.w #N,Ak 



sub, 

mul . 

mul 

mul 

mul 

div 



#N,Sk 
#N,Ak 
#N,Sk 
#N,Ak 
#N,Sk 
#N,Ak 



, upper half 
upper half 



dlv.h #N,Sk 
div.w #N,Ak 
div.w #N,Sk 



Error Exit Instruction 

Jump Always 

Jump on ION false 

Jump on ION true 

Jump on address carry false 

Jump on address carry true 

Jump on scalar carry false 

Jump on scalar carry true 

Load effective address 

Load VS and VL from memory 

Load VM from memory 

Test and Set a memory byte 

Push effective address 

Store VS and VL to memory 

store VM into memory 

Halt the central processing unit 

Load immediate, most significant bits 

Perform a system call 

Load 64 bit floating immed. 

Load 64 bit integer immed. , 

Load immediate, upper half 

Load half word imm. into Ak 

Load 64 bit sign extended immediate 

Load imm. into Ak 

Load 64 bit floating immed., lower half 

Load 64 bit integer immed., lower half 

Load a 32 bit immediate 

AND imm. to addr. reg. 

AND scalar/immediate 

OR imm. to addr. reg. 

OR scalar/immediate 

Exclusive OR imm. to addr. reg. 

Exclusive OR scalar/immediate 

Logical shift imm. to addr. reg. 

Shift Scalar/immediate 

Add imm. address half word 

Add scalar/immed. integer halfword 

Add imm. address word 

Add scalar/immed. "integer word 

Subtract imm. address halfword 

Subtract scalar/immed. integer halfword 

Subtract imm. address word 

Subtract scalar/immed. integer word 

Multiply imm. address halfword 

Multiply scalar/immed. integer halfword 

Multiply imm. address word 

Multiply scalar/immed. integer word 

Divide imm. address halfword 

Divide scalar /scalar integer halfword 

Divide imm. address word 

Divide scalar /scalar integer word 
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0x1800 16-3 ld.w #N,VL 

0x1808 10-17 add.s #N, Sk 

0x1880 16-5 ld.w #N,VS 

0x1888 10-18 sub.s #N, Sk 

0x1908 10-19 mul.s #N, Sk 

0x1988 10-20 div.s #N, Sk 

0xla08 10-31 le.s #N,Sk 

0xla88 10-31 lt.s #N,Sk 

OxlbOO 9-41 eq.h #N,Ak 

Oxlb08 10-31 eq.h #N,Sk 

Oxlb80 9-41 eq.w #N,Ak 

Oxlb88 10-31 eq.w #N,Sk 

OxlcOO 9-42 leu.h #N,Ak 

0xlc08 10-33 leu.h #N, Sk 

Oxlc80 9-42 leu.w #N,Ak 

0xlc88 10-33 leu.w #N,Sk 

OxldOO 9-42 ltu.h #N,Ak 

Oxld08 10-33 ltu.h #N,Sk 

0xld80 9-43 ltu.w #N,Ak 

0xld88 10-33 ltu.w #N, Sk 

OxleOO 9-41 le.h #N,Ak 

0xleO8 10-31 le.h #N, Sk 

OxleSO 9-41 le.w #N,Ak 

0xle88 10-31 le.w #N,Sk 

OxlfOO 9-41 lt.h #N,Alc 

Oxl f 08 10-31 lt.h #N,Sk 

0xlf80 9-41 lt.w #N,Ak 

0xlf88 10-31 lt.w #N,Sk 

0x2000 11-10 call <effa> 

0x2100 11-11 calls <effa> 

0x2200 11-14 callq <effa> 

0x2400 13-18 ste.b Sk,<effa> 

0x2500 13-18 ste.h Sk,<effa> 

0x2600 13-18 ste.s Sk,<effa> 

0x2600 13-18 ste.w Sk,<effa> 

0x2700 13-18 ste.d Sk,<effa> 

0x2700 13-18 ste.l Sk,<effa> 

0x2800 9-4 ld.b <effa>,Ak 

0x2900 9-4 ld.h <effa>,Ak 

Ox2aOO 9-4 ld.w <effa>,Ak 

Ox2cOO 9-5 st.b Ak,<effa> 

Ox2dOO 9-5 st.h Ak,<effa> 

Ox2eOO 9-5 st.w Ak, <ef fa> 

0x3000 10-4 ld.b <effa> / Sk 

0x3100 10-4 ld.h <effa>,Sk 

0x3200 10-4 ld.s <effa>,Sk 

0x3200 10-4 ld.w <effa>,Sk 

0x3300 10-4 ld.d <effa>,Sk 

0x3300 10-4 ld.l <effa>,Sk 

0x3400 10-5 st.b Sk,<effa> 

0x3500 10-5 st.h Sk, <ef fa> 

0x3600 10-5 st.s Sk,<effa> 

0x3600 10-5 st.w Sk,<effa> 

0x3700 10-5 st.d Sk,<effa> 



Load VL with an immediate 

Add scalar/immed. single float 

Load VS from an immediate 

Subtract scalar/immed. single float 

Multiply scalar/immed. single float 

Divide scalar/scalar single float 

Compare less than or equal single 

Compare less than single 

Compare equal halfword 

Compare equal halfword 

Compare equal word 

Compare equal word 

Compare unsigned less than halfword 

Compare unsigned less than or equal halfword 

Compare unsigned less than word 

Compare unsigned less than or equal word 

Compare unsigned less than halfword 

Compare unsigned less than halfword 

Compare unsigned less than word 

Compare unsigned less than word 

Compare less than or equal halfword 

Compare less than or equal halfword 

Compare less than or equal word 

Compare less than or equal word 

Compare less than halfword 

Compare less than halfword 

Compare less than word 

Compare less than word 

Call a subroutine, make a long frame 

Call a subroutine, make a short frame 

Push the program counter and jump 
Store an extended scalar byte 
Store an extended scalar halfword 
Store an extended scalar single float 

Store an extended scalar word 

Store an extended scalar double float 

Store an extended scalar longword 

Load addr. reg. byte 

Load addr. reg. halfword 

Load addr. reg. word 

Store addr. reg. byte 

Store addr. reg. halfword 

Store addr . reg. word 

Load scalar byte 

Load scalar halfword 

Load scalar single float 

Load scalar word 

Load scalar double float 

Load scalar longword 

Store scalar byte 

Store scalar halfword 

Store scalar single float 

Store scalar word 

Store scalar double float 
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0x3700 


10-5 


0x3800 


13-8 


0x3900 


13-8 


0x3aO0 


13-8 


0x3a00 


13-8 


0x3b00 


13-8 


0x3b00 


13-8 


0x3c00 


13-10 


0x3d00 


13-10 


0x3e00 


13-10 


0x3e00 


13-10 


0x3 f 00 


13-10 


0x3 f 00 


13-10 


0x4000 


9-44 


0x4040 


9-44 


0x4080 


9-44 


0x40c0 


9-44 


0x4100 


10-34 


0x4140 


10-34 


0x4180 


10-34 


0x41c0 


10-34 


0x4200 


10-34 


0x4240 


10-34 


0x4280 


10-34 


0x42c0 


10-34 


0x4300 


10-34 


0x4340 


10-34 


0x4380 


10-34 


0x43c0 


10-34 


0x4400 


9-33 


0x4440 


9-25 


0x4480 


9-6 


0x44c0 


9-6 


0x4500 


10-34 


0x4540 


10-34 


0x4580 


10-38 


0x45c0 


10-37 


0x4600 


9-37 


0x4640 


9-37 


0x4680 


9-40 


Ox46cO 


9-40 


0x4700 


10-27 


0x4740 


10-27 


0x4780 


10-27 


0x47c0 


10-27 


0x4800 


9-39 


0x4840 


9-39 


0x4880 


9-42 


0x48c0 


9-42 


0x4900 


10-29 


0x4940 


10-29 


0x4980 


10-29 


0x49c0 


10-29 


0x4a00 


9-39 
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st.l Sk,<effa> 
ld.b <effa>,Vk 
ld.h <effa>,Vk 
ld.s <effa>,Vk 
ld.w <effa>,Vk 
ld.d <effa>,Vk 
ld.l <effa> / Vk 
st.b Vk,<effa> 
st.h Vk,<effa> 
st.s Vk,<effa> 
st.w Vk,<effa> 
st.d Vk,<effa> 
st.l Vk,<effa> 
cvtw.b Aj,Ak 
cvtw.h Aj ,Ak 
cvtb.w Aj,Ak 
cvth.w Aj,AK 
cvtw.b Sj,Sk 
cvtw.h Sj,Sk 
cvtb.w Sj,Sk 
cvth.w Sj,Sk 
cvtw . s S j , Sk 
cvts.w Sj,Sk 
cvtd.s Sj,Sk 
cvts.d Sj,Sk 
cvts.l Sj,Sk 
cvtd.l Sj,Sk 
cvtl.S4Sj,Sk 
cvtl.d Sj,Sk 
ldpa Aj,Ak 
shf #n,Ak 
ld.h #n,Ak 
ld.w #n,Ak 
cvtl.w Sj,Sk 
cvtw.l Sj,Sk 
plc.t Sj,Sk 
tzc Sj,Sk 
eq.h Aj ,Ak 
eq.w Aj ,Ak 
eq.h #n,Ak 
eq.w #n,Ak 
eq.b Sj,Sk 
eq.h Sj,Sk 
eq.w Sj,Sk 
eq.l Sj,Sk 
leu.h Aj,Ak 
leu.w Aj,Ak 
leu.h #n,Ak 
leu.w #n,Ak 
leu.b Sj,Sk 
leu.h Sj,Sk 
leu.w Sj,Sk 
leu.l Sj,Sk 
ltu.h gt,Aj,Ak 



Store scalar longword 

Load vector byte 

Load vector halfword 

Load vector single float 

Load vector word 

Load vector double float 

Load vector longword 

Store vector byte 

Store vector halfword 

Store vector single float 

Store vector word 

Store vector double float 

Store vector longword 

Convert word to byte 

Convert word to halfword 

Convert byte to word 

Convert half to word 

Convert word to byte 

Convert word to halfword 

Convert byte to word 

Convert halfword to word 

Convert word to single float 

Convert single float to word 

Convert double float to single float 

Convert single float to double float 

Convert single float to longword 

Convert double float to longword 

Convert longword to single float 

Convert longword to double float 

Load a physical byte address into Ak 

Logical shift left short imm. 

Load short imm. into Ak 

Load short imm. into Ak 

Convert longword to word 

Convert word to longword 

Count the number of 1 ' s in S j 

Count of trailing zeroes in Sj 

Compare equal halfword 

Compare equal word 

Compare equal halfword 

Compare equal word 

Compare equal byte 

Compare equal halfword 

Compare equal word 

Compare equal longword 

Compare unsigned less than or equal halfword 

Compare unsigned less or equal than word 

Compare unsigned less than or equal halfword 

Compare unsigned less than or equal word 

Compare less than or equal byte 

Compare less than or equal halfword 

Compare less than or equal word 

Compare less than or equal longword 

Compare unsigned less than halfword 
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Ox4a40 

0x4a80 

0x4ac0 

0x4bO0 

Ox4b40 

0x4b80 

Ox4bcO 

0x4c00 

Ox4c40 

0x4c80 

0x4cc0 

0x4d00 

0x4d40 

0x4d80 

0x4dc0 

0x4e00 

0x4e40 

0x4e80 

0x4ec0 

0x4 f 00 

0x4f40 

0x4 f 80 

0x4 fcO 

0x5000 

0x5040 

0x5080 

0x50c0 

0x510O 

0x5100 

0x5140 

0x5180 

0x5180 

0x51c0 

0x5200 

0x5240 

0x5280 

0x52c0 

0x5300 

0x5340 

0x5380 

Ox53cO 

0x5400 

0x5440 

0x5480 

0x54c0 

0x5500 

0x5540 

0x5580 

0x55c0 

0x5600 

0x5640 

0x5680 

Ox56cO 

0x5700 



9-39 

9-42 

9-42 

10-29 

10-29 

10-29 

10-29 

9-37 

9-37 

9-40 

9-40 

10-27 

10-27 

10-27 

10-27 

9-37 

9-37 

9-40 

9-40 

10-27 

10-27 

10-27 

10-27 

9-16 

9-24 

9-29 

10-39 

10-41 

10-41 

10-36 

10-41 

10-41 

10-40 

9-17 

9-18 

9-19 

9-20 

10-13 

10-14 

10-15 

10-16 

10-27 

10-27 

10-27 

10-27 

10-8 

10-8 

10-9 

10-9 

10-27 

10-28 

9-11 

9-11 

10-10 



ltu.w 
ltu.h 



gt,Aj. 
#n,Ak 



le.w 
le.l 
lt.h 
lt.v 



It.b 
lt.h 



ltu.w #n,Ak 
ltu.b Sj,Sk 
ltu.h Sj,Sk 
ltu.w Sj,Sk 
ltu.l Sj,Sk 
le.h Aj,Ak 
le.w Aj,AK 
le.h #n,Ak 
le.w #n,Ak 
le.b Sj,Sk 
le.h Sj,Sk 
Sj,Sk 
Sj,Sk 
Aj,Ak 
Aj,Ak 
lt.h #n,Ak 
lt.w #n,Ak 
Sj,Sk 
Sj,Sk 
lt.w Sj,Sk 
lt.l Sj,Sk 
add.w Sj,Ak 
shf Aj,Ak 
iov Aj,Ak 
mov Sj,Ak 
mov.s Sj,Sk 
mov.w Sj,Sk 
shf Sj,Sk 
mov.d Sj,Sk 
mov.l Sj,Sk 
mov Aj , Sk 
and A j , Ak 
or Aj,Ak 
xor Aj , Ak 
not Aj,Ak 
and Sj,Sk 
or Sj,Sk 
xor S j , Sk 
not Sj,Sk 
le.s Sj,Sk 
le.d Sj,Sk 
lt.s Sj,Sk 
lt.d Sj,Sk 
add.s Sj,Sk 
add.d Sj,Sk 
sub.s S j , Sk 
sub.d Sj,Sk 
eq.s Sj,Sk 
eq.d Sj,Sk 
neg.h Aj,Ak 
neg.w Aj,Ak 
mul . s S j . Sk 
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Ak Compare unsigned less than word 

Compare unsigned less than half word 

Compare unsigned less than word 

Compare less than byte 

Compare less than half word 

Compare less than word 

Compare less than longword 

Compare less than or equal signed half word 

Compare less than or equal signed word 

Compare less than or equal half word 

Compare less than or equal word 

Compare less than or equal byte 

Compare less than or equal halfword 

Compare less than or equal word 

Compare less than or equal longword 

Compare less than signed halfword 

Compare less than signed word 

Compare less than halfword 

Compare less than word 

Compare less than byte 

Compare less than halfword 

Compare less than word 

Compare less than longword 

Add scalar to addr word 

Shift an address 

Move addr. reg. 

Move 32 bits of Sj into Ak. 

Move scalar register double float 

Move scalar register word 

Shift a scalar 

Move scalar register single float 

Move scalar register longword 

Move an address to a scalar 

AND addr. reg. 

OR addr. reg. 

Exclusive OR addr. reg. 

Complement addr. reg. 

AND scalar/scalar 

OR scalar/scalar 

Exclusive OR scalar/scalar 

Complement scalar/scalar 

Compare less than or equal single float 

Compare less than or equal double float 

Compare less than single float 

Compare less than double float 

Add scalar/scalar single float 

Add scalar/scalar double float 

Subtract scalar/scalar single float 

Subtract scalar /scalar double float 

Compare equal single float 

Compare equal double float 

Negate addr. reg. halfword 

Negate addr. reg. word 

Multiply scalar/scalar single float 
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0x5740 10-10 


mul . d 


Sj,Sk 


0x5780 10-11 


div.s 


Sj,Sk 


0x57c0 10-11 


div.d 


Sj,Sk 


0x5800 9-7 


add.h 


Aj,Ak 


0x5840 9-7 


add.w 


Aj,Ak 


0x5880 9-12 


add.h 


#n,Ak 


0x58c0 9-12 


add.w 


#n,Ak 


0x5900 10-8 


add.b 


Sj,Sk 


0x5940 10-8 


add.h 


Sj,Sk 


0x5980 10-8 


add.w 


Sj,Sk 


Ox59cO 10-8 


add.l 


Sj,Sk 


0x5a00 9-8 


sub.h 


Aj,Ak 


0x5a40 9-8 


sub.w 


Aj,Ak 


0x5a80 9-13 


sub.h 


#n,Ak 


0x5ac0 9-13 


sub.w 


#n,AK 


0x5b00 10-9 


sub.b 


Sj,Sk 


0x5b40 10-9 


sub.h 


Sj,Sk 


0x5b80 10-9 


sub.w 


Sj,Sk 


0x5bc0 10-9 


sub. 1 


Sj,Sk 


0x5c00 9-9 


mul .h 


Aj,Ak 


0x5c40 9-9 


mul . w 


Aj,AK 


0x5c80 9-14 


mul .h 


#n,Ak 


0x5cc0 9-14 


mul.w 


#n,Ak 


0x5d00 10-10 


mul.b 


Sj,Sk 


0x5d40 10-10 


mul .h 


Sj,Sk 


0x5d80 10-10 


mul.w 


Sj,Sk 


0x5dc0 10-10 


mul. 1 


Sj,Sk 


0x5e00 9-10 


dlv.h 


Aj,Ak 


0x5e40 9-10 


dlv.w 


Aj.Ak 


0x5e80 9-15 


div.h 


#n,Ak 


0x5ec0 9-15 


div.w 


#n,Ak 


0x5 f 00 10-11 


div.b 


Sj , Sk 


0x5 f 40 10-11 


div.h 


Sj,Sk 


0x5 f 80 10-11 


div.w 


Sj,Sk 


0x5 fcO 10-11 


div.l 


Sj,Sk 


0x6100 16-13 


mov Sj,Sk,VM 


0x6140 16-13 


mov Sj,VM,Sk 


0x6 2cO 13-34 


not Vj,Vk 


0x6300 13-38 


shf Sj.Vk 


0x6340 15-11 


plc.t 


Vj,Vk 


0x6380 14-8 


cprs. 


f Vj,Vk 


Ox63cO 14-8 


cprs. 


t Vj,Vk 


0x6400 14-4 


eq.s 


Vj,Vk 


0x6440 14-5 


eq.d 


Vj,Vk 


0x6480 13-25 


neg.s 


Vj,Vk 


0x64c0 13-25 


neg.d 


Vj,Vk 


0x6500 14-6 


eq.s 


Sj.Vk 


0x6540 14-7 


eq.d 


Sj,Vk 


0x6580 10-12 


neg.s 


Sj,Sk 


0x65c0 10-12 


neg.d 


Sj,Sk 


0x6600 14-4 


le.s 


Vj,Vk 


0x6640 14-4 


le.d 


Vj,Vk 


0x6680 14-4 


lt.s 


Vj,Vk 


Ox66cO 14-4 


lt.d 


Vj,Vk 
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Multiply scalar/scalar double float 

Divide scalar/scalar single float 

Divide scalar/scalar double float 

Add addr. reg. half word 

Add addr. reg. word 

Add short imm. address half word 

Add short imm. address word 

Add scalar/scalar integer byte 

Add scalar/scalar integer halfword 

Add scalar/scalar integer word 

Add scalar/scalar integer longword 

Subtract addr. reg. halfword 

Subtract addr. reg. word 

Subtract short imm. address halfword 

Subtract short imm. address word 

Subtract scalar /scalar integer byte 

Subtract scalar /scalar integer halfword 

Subtract scalar/scalar integer word 

Subtract scalar/scalar integer longword 

Multiply addr. reg. halfword 

Multiply addr. reg. word 

Multiply short imm. address halfword 

Multiply short imm. address word 

Multiply scalar/scalar integer byte 

Multiply scalar /scalar integer halfword 

Multiply scalar /scalar integer word 

Multiply scalar/scalar integer longword 

Divide addr. reg. halfword 

Divide addr. reg. word 

Divide short imm. address halfword 

Divide short imm. address word 

Divide scalar /scalar integer byte 

Divide scalar/scalar integer halfword 

Divide scalar/scalar integer word 

Divide scalar/scalar integer longword 

Load VM(Sj) from Sk. 

Load Sk from VM 

Complement a vector 

Shift a vector accumulator 

Population Count of a Vector 

Compress a vector using not VM 

Compress a vector using VM 

Compare equal single 

Compare equal double precision 

Negate vector/vector single float 

Negate vector/vector double float 

Compare equal single 

Compare equal double precision 

Negate scalar/scalar single float 

Negate scalar/scalar double float 

Compare less than or equal single 

Compare less than or equal double float 

Compare less than single 

Compare less than double float 
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0x6700 14-6 le.s Sj,Vk 
0x6740 14-6 le.d Sj,Vk 
0x6780 14-6 lt.s Sj,Vk 
0x67c0 14-6 lt'.d Sj,Vk 
0x6800 14-4 eq.b Vj,Vk 
0x6840 14-4 eq.h Vj,Vk 

0x6880 14-4 eq.w Vj,Vk 

0x68c0 14-4 eq.l Vj,Vk 

0x6900 14-6 eq.b Sj,Vk 

0x6940 14-6 eq.h Sj,Vk 

0x6980 14-6 eq.w Sj,Vk 

0x69c0 14-6 eq.l Sj,Vk 

0x6a00 14-4 le.b Vj,Vk 

0x6a40 14-4 le.h Vj,Vk 

0x6a80 14-4 le.w Vj,Vk 

0x6ac0 14-4 le.l Vj,Vk 

0x6b00 14-6 le.b Sj,Vk 

0x6b40 14-6 le.h Sj,Vk 

0x6b80 14-6 le.w Sj,Vk 

0x6bc0 14-6 le.l Sj,Vk 

0x6c00 14-4 lt.b Vj,Vk 

0x6c40 14-4 lt.h Vj,Vk 

0x6c80 14-4 lt.w Vj,Vk 

0x6cc0 14-4 lt.l Vj,Vk 

0x6d0O 14-6 lt.b Sj,Vk 

0x6d40 14-6 lt.h Sj,Vk 

0x6d80 14-6 lt.w Sj,Vk 

0x6dc0 14-6 lt.l Sj,Vk 

0x6e00 13-25 neg.b Vj,Vk 

0x6e40 13-25 neg.h Vj,Vk 

0x6e80 13-25 neg.w V j , Vk 

0x6ec0 13-25 neg.l Vj,Vk 

0x6f00 10-12 neg.b S j , Sk 

Ox6f40 10-12 neg.h S j , Sk 

0x6f80 10-12 neg.w S j , Sk 

0x6fc0 10-12 neg.l S j , Sk 

0x7000 11-5 nop 
0x7100 11-5 " br 

0x7200 11-5 bri.f 

0x7300 11-5 bri.t 

0x7400 11-5 bra.f 

0x7500 11-5 bra.t 

0x7600 11-5 brs.f 

0x770O 11-5 brs.t 

0x7800 13-12 ldvi.b Vj,Vk 

0x7840 13-12 ldvi.h Vj,Vk 

0x7880 13-12 ldvi.s Vj,Vk 

0x7880 13-12 ldvi.w Vj,Vk 

Ox78cO 13-12 ldvi.d Vj,Vk 

Ox78cO 13-12 ldvi.l Vj,Vk 

Ox7E70 12-23 mov Sk,W 

0x7E78 12-24 tstvv 

0x7a00 13-14 stvi.b Vk,Vj 

0x7a40 13-14 stvi.h Vk,Vj 



Compare less than or equal single 

Compare less than or equal double float 

Compare less than single 

Compare less than double float 

Compare equal byte 

Compare equal half word 

Compare equal word 

Compare equal longword 

Compare equal byte 

Compare equal halfword 

Compare equal word 

Compare equal longword 

Compare less than or equal byte 

Compare less than or equal halfword 

Compare less than or equal word 

Compare less than or equal longword 

Compare less than or equal byte 

Compare less than or equal halfword 

Compare less than or equal word 

Compare less than or equal longword 

Compare less than byte 

Compare less than halfword 

Compare less than word 

Compare less than longword 

Compare less than byte 

Compare less than halfword 

Compare less than word 

Compare less than longword 

Negate vector/vector integer byte 

Negate vector/vector integer halfword 

Negate vector/vector integer word 

Negate vector/vector integer longword 

Negate scalar/scalar integer byte 

Negate scalar/scalar integer halfword 

Negate scalar /scalar integer word 

Negate scalar/scalar integer longword 

No Operation 

Branch Always 

Branch on ION false 

Branch on ION true 

Branch on address carry false 

Branch on address carry true 

Branch on scalar carry false 

Branch on scalar carry true 

Index Load vector byte 

Index Load vector halfword 

Index Load vector single float 

Index Load vector word 

Index Load vector double float 

Index Load vector longword 

Move scalar to vector valid flag 

Test value of vector valid flag 

Index Store vector byte 

Index Store vector halfword 
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0x7a80 13-14 


stvl.s Vk,Vj 


0x7a80 13-14 


stvi.w Vk,Vj 


Ox7acO 13-14 


stvi.d Vk,Vj 


Ox7acO 13-14 


stvi.l Vk,Vj 


Ox7bOO 13-16 


stvi.b Sk,Vj 


0x7b40 13-16 


stvi.h Sk,Vj 


0x7b80 13-16 


stvi.s Sk,Vj 


Ox7b80 13-16 


stvi.w Sk,Vj 


Ox7bcO 13-16 


stvi.d Sk,Vj 


Ox7bcO 13-16 


stvi.l Sk,Vj 


Ox7cOO 12-5 


ldsdr Ak 


Ox7c08 12-6 


ldkdr Ak 


Ox7c20 12-8 


patu 


Ox7c28 12-9 


pate Ak 


Ox7c30 12-13 


pich 


0x7c38 12-14 


plch 


0x7c40 9-35 


mov PSW / Ak 


0x7c48 9-36 


mov Ak,psw 


0x7c50 9-28 


mov PC,Ak 


0x7c60 12-16 


mov ITR,Sk 


0x7c68 12-15 


mov Sk,ITR 


0x7c78 12-17 


mov Sk , itsr 


0x7c80 11-15 


rtnq 


0x7c90 11-12 


rtn 


Ox7ca8 12-10 


rtnc 


0x7ca8 16-9 


mov.w Sk,VS 


0x7cb8 16-8 


mov.w Sk,VL 


0x7d00 9-30 


psh.w Ak 


0x7dl0 9-31 


pop . w Ak 


0x7d20 10-25 


psh.w Sk 


0x7d28 10-25 


psh.l Sk 


0x7d30 10-26 


pop . w Sk 


0x7d38 10-26 


pop . 1 Sk 


0x7d40 12-12 


eni 


0x7d48 12-12 


dsi 


0x7d50 11-9 


bkpt 


0x7d60 12-19 


mski Sk 


0x7d68 12-18 


xmti Sk 


0x7d80 16-4 


mov VS,Ak 


0x7d88 16-4 


mov Ak,VS 


0x7d90 16-2 


mov VL,Ak 


0x7d98 16-2 


mov Ak,VL 


0x7dc0 12-21 


diag Ak 


0x7e00 15-2 


sum.b Vk 


0x7e08 15-2 


sum.h Vk 


0x7el0 15-2 


sum.w Vk 


0x7el8 15-2 


sum. 1 Vk 


0x7e20 15-8 


all Vk 


0x7e28 15-9 


any Vk 


0x7e30 15-10 


parity Vk 


0x7e40 15-6 


max.b Vk 


0x7e48 15-6 


max.h Vk 


0x7e50 15-6 


max.w Vk 


0x7e58 15-6 


max . 1 Vk 
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Index Store vector single float 

Index Store vector word 

Index Store vector double float 

Index Store vector longword 

Scalar Index Store vector byte 

Scalar Index Store vector halfword 

Scalar Index Store vector single float 

Scalar Index Store vector word 

Scalar Index Store vector double float 

Scalar Index Store vector longword 

Load process SDR ' s 

Load all 8 SDR's 

Purge the entire ATU 

Purge ATU entry 

Purge the instruction cache 

Purge the logical cache 

Store the PSW into an addr. reg. 

Load an addr. reg. into the PSW 

Load next PC address 

Move the itc,itsr,nitc into Sk 

Load NITC, ITC, ITSR from Sk 

Load ITSR with a scalar 

Pop the program counter and jump 

Return from subroutine call 

Return from a context block 

Move Sk to VS 

Move Sk to VL 

Push an addr. reg. 

Pop word into addr. reg. 

Push Sk<31. .0> onto the stack 

Push Sk<63 . . 0> onto the stack. 

Pop Sk<31 . . 0> from the stack 

Pop Sk<63..0> from the stack. 

Enable interrupts, set ion to 1 

Disable interrupts , reset ion to 

Breakpoint 

Mask Out Interrupt 

Transmit Interrupt 

Move VS to Ak 

Move Ak to VS 

Move VL to Ak 

Move Ak to VL 

Execute non-standard microcode sequence 

Sum a vector of bytes 

Sum a vector of ha If words 

Sum a vector o f words 

Sum a vector of longwords 

AND reduce a vector 

OR reduce a vector 

Exclusive OR reduce a vector 

Max of a vector of bytes 

Max of a vector of halfwords 

Max of a vector of words 

Max of a vector of longwords 
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0x7e60 

0x7e68 

Ox7e70 

0x7e78 

0x7e8O 

0x7e88 

Ox7e90 

0x7e98 

Ox7eaO 

Ox7ea8 

0x7eb0 

0x7eb8 

0x7ecO 

0x7ec8 

Ox7edO 

Ox7ed8 

Ox7eeO 

0x7ee8 

0x8000 

0x820O 

0x8400 

0x8600 

0x8800 

0x8a00 

0x8c00 

0x8e00 

0x900O 

0x9200 

0x9400 

0x9600 

0x9800 

0x9a00 

Ox9cOO 

0x9e00 

OxaOOO 

Oxa200 

0xa400 

Oxa800 

OxaaOO 

OxacOO 

OxbOOO 

Oxb200 

0xb400 

Oxb600 

Oxb800 

OxbaOO 

OxbcOO 

OxbeOO 

OxcOOO 

Oxc200 

0xc400 

Oxc600 

OxcSOO 

OxcaOO 



15-7 

15-7 

15-7 

15-7 

15-2 

15-2 

15-4 

15-4 

15-6 

15-6 

15-7 

15-7 

15-4 

15-4 

15-4 

15-4 

16-12 

16-12 

13-40 

13-39 

14-10 

14-13 

14-12 

14-14 

14-12 

14-14 

13-22 

13-22 

13-23 

13-23 

13-28 

13-28 

13-29 

13-29 

13-31 

13-32 

13-33 

13-35 

13-36 

13-37 

13-20 

13-20 

13-21 

13-21 

13-26 

13-26 

13-27 

13-27 

13-20 

13-20 

13-20 

13-20 

13-26 

13-26 



mm 
mln 



Vk 
Vk 



min.w Vk 
min . 1 Vk 
sum.s Vk 
sum.d Vk 
prod.s Vk 
prod.d Vk 
max.s Vk 
max.d Vk 
min.s Vk 
min.d Vk 
prod.b Vk 
prod.h Vk 
prod.w Vk 
prod.l Vk 
plc.f VM,Sk 
plc.t VM, Sk 
mov Vi,Sj,Sk 
mov Si,Sj,Vk 
merg.t Vi,Vj,Vk 
mask.t Vi,Vj,Vk 
merg.f Vi,Sj,Vk 
mask.f Vi,Sj,Vk 
merg.t Vi,Sj,Vk 
mask.t Vi,Sj,Vk 
mul.s Vi,Vj,Vk 
mul.d Vi,Vj,Vk 
div.s Vi,Vj,Vk 
div.d Vi,Vj,Vk 
mul.s Vi,Sj,Vk 
mul.d Vi,Sj,Vk 
div.s Vi,Sj,Vk 
div.d Vi.Sj,Vk 
and Vi,Vj,Vk 
or Vi,Vj,Vk 
xor Vi,Vj,Vk 
and Vi,Sj,Vk 
or Vi,Sj,Vk 
xor Vi,Sj,Vk 
add.s Vi,Vj,Vk 
add.d Vi,Vj,Vk 
sub.s Vl,Vj,Vk 
sub.d Vi,Vj,Vk 
add.s Vi,Sj,Vk 
add.d Vi,Sj,Vk 
sub.s Vi,Sj,Vk 
sub.d Vi,Sj,Vk 
add.b Vi,Vj,Vk 
add.h Vi,Vj,Vk 
add.w Vi,Vj,Vk 
add.l Vi,Vj,Vk 
add.b Vi,Sj,Vk 
add.h Vi Sj,Vk 



Sum 

Sum 



Multiply reduce 
Multiply reduce 
Multiply reduce 
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Min of a vector of bytes 
Min of a vector of ha If words 
Min of a vector of words 
Min of a vector of longwords 
a vector of single float 
a vector of double float 
Multiply reduce a vector of single float 
Multiply reduce a vector of double float 
Max of a vector of single float 
Max of a vector of double float 
Min of a vector of single float 
Min of a vector of double float 
Multiply reduce a vector of bytes 

a vector of ha If words 
a vector of words 
a vector of longwords 
Load the number of O's in VM into Sk 
Load the number of l*s in VM into Sk 
Move a vector element to a scalar 
Move a scalar to a vector element 
Merge vector/vector 
Mask vector/vector 
Merge vector/scalar using not VM 
Mask vector/scalar using not VM 
Merge vector/scalar 
Mask vector/scalar using VM 
Multiply vector/vector single float 
Multiply vector/vector double float 
Divide vector/vector single float 
Divide vector /vector double float 
Multiply vector/scalar single float 
Multiply vector/scalar double float 
Divide vector/scalar single float 
Divide vector/scalar double float 
AND two vectors 
OR two vectors 
Exclusive OR two vectors 
AND vector/scalar 
OR vector/scalar 
Exclusive OR vector/scalar 
Add vector/vector single float 
Add vector/vector double float 
Subtract vector/vector single float 
Subtract vector/vector double float 
Add vector/scalar single float 
Add vector/scalar double float 
Subtract vector/scalar single float 
Subtract vector/scalar double float 
Add vector/vector integer byte 
Add vector/vector integer hal fword 
Add vector/vector integer word 
Add vector/vector integer longword 
Add vector/scalar integer byte 
Add vector/scalar integer hal fword 
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OxccOO 


13- 


•26 


add. w 


Vi,Sj,Vk 


OxceOO 


13- 


26 


add.l 


Vi,Sj,Vk 


OxdOOO 


13- 


•21 


sub.b 


Vi,Vj,Vk 


Oxd200 


13- 


•21 


sub.h 


Vi.Vj^Vk 


0xd400 


13- 


•21 


sub. w 


Vi,Vj,Vk 


Oxd600 


13- 


•21 


sub. 1 


Vi,Vj,Vk 


Oxd800 


13- 


-27 


sub.b 


Vi,Sj,Vk 


OxdaOO 


13- 


-27 


sub . h 


Vi / Sj,Vk 


OxdcOO 


13- 


•27 


sub. w 


Vi,Sj / Vk 


OxdeOO 


13- 


•27 


sub. 1 


Vi.Sj,Vk 


OxeOOO 


13- 


•22 


mul.b 


Vi,Vj / Vk 


Oxe200 


13- 


•22 


mul .h 


Vi,Vj,Vk 


0xe400 


13- 


-22 


mul . w 


Vi,Vj,Vk 


Oxe600 


13- 


-22 


mul . 1 


Vi / Vj,Vk 


Oxe800 


13- 


-28 


mul .b 


Vi.Sj,Vk 


OxeaOO 


13- 


•28 


mul .h 


Vi,Sj,Vk 


OxecOO 


13- 


-28 


mul .w 


Vi,Sj,Vk 


OxeeOO 


13- 


-28 


mul. 1 


Vi,Sj / Vk 


Ox f OOO 


13- 


-23 


dlv.b 


Vi,Vj,Vk 


Ox f 200 


13- 


-23 


div.h 


Vi,Vj y Vk 


Ox f 400 


13- 


-23 


div.w 


Vi,Vj,Vk 


Ox f 600 


13- 


-23 


div.l 


Vi,Vj,Vk 


Ox f 800 


13- 


-29 


div.b 


Vi,Sj,Vk 


OxfaOO 


13- 


-29 


div.h 


Vi,Sj,Vk 


OxfcOO 


13- 


-29 


div.w 


Vi,Sj,Vk 


OxfeOO 


13- 


-29 


div.l 


Vi.Sj.Vk 



Add vector/scalar integer word 
Add vector/scalar integer longword 
Subtract vector/vector integer byte 
Subtract vector/vector integer halfword 
Subtract vector/vector integer word 
Subtract vector/vector integer longword 
Subtract vector/scalar integer byte 
Subtract vector/scalar integer halfword 
Subtract vector/scalar integer word 
Subtract vector/scalar integer longword 
Multiply vector/vector integer byte 
Multiply vector/vector integer halfword 
Multiply vector/vector integer word 
Multiply vector/vector integer longword 
Multiply vector/scalar integer byte 
Multiply vector/scalar integer halfword 
Multiply vector/scalar integer word 
Multiply vector/scalar integer longword 
Divide vector/vector integer byte 
Divide vector/vector integer halfword 
Divide vector/vector integer word 
Divide vector/vector integer longword 
Divide vector/scalar integer byte 
Divide vector/scalar integer halfword 
Divide vector/scalar integer word 
Divide vector/scalar integer longword 
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APPENDIX C 
OP CODES SORTED BY NAME 



0x5900 


10-8 


add.b 


Sj,Sk 


Oxc800 


13-26 


add.b 


Vi,Sj,Vk 


OxcOOO 


13-20 


add.b 


Vi,Vj,Vk 


0x5540 


10-8 


add . d 


Sj,Sk 


OxbaOO 


13-26 


add.d 


Vi,Sj,Vk 


Oxb200 


13-20 


add.d 


Vi,Vj,Vk 


0x1400 


9-12 


add.h 


#N,Ak 


0x1408 


10-17 


add.h 


#N,Sk 


0x5880 


9-12 


add.h 


#n,Ak 


0x5800 


9-7 


add.h 


Aj,Ak 


0x5940 


10-8 


add.h 


Sj,Sk 


OxcaOO 


13-26 


add.h 


Vi Sj,Vk 


Oxc200 


13-20 


add.h 


Vi,Vj,Vk 


0x59c0 


10-8 


add.l 


Sj,Sk 


OxceOO 


13-26 


add.l 


Vi,Sj,Vk 


Oxc600 


13-20 


, add.l 


Vi,Vj,Vk 


0x1808 


10-17 


add.s 


#N,Sk 


0x5500 


10-8 


add.s 


Sj,Sk 


Oxb800 


13-26 


add.s 


Vi,Sj,Vk 


OxbOOO 


13-20 


add.s 


Vi,Vj,Vk 


0x1480 


9-12 


add.w 


#N,Ak 


0x1488 


10-17 


add.w 


#N,Sk 


0x58c0 


9-12 


add.w 


#n,Ak 


0x5840 


9-7 


add.w 


Aj,Ak 


0x5000 


9-16 


add.w 


Sj,Ak 


0x5980 


10-8 


add.w 


Sj,Sk 


OxceOO 


13-26 


add.w 


Vl,Sj,Vk 


0xc400 


13-20 


add.w 


Vi,Vj,Vk 


0x7e20 


15-8 


all Vk 


0x1200 


9-21 


and #N,Ak 


0x1208 


10-21 


and #N,Sk 


0x5200 


9-17 


and A j , Ak 


0x5300 


10-13 


and Sj,Sk 


0xa800 


13-35 


and Vi,Sj,Vk 


OxaOOO 


13-31 


and Vi,Vj,Vk 


Ox7e28 


15-9 


any Vk 


0x7d50 


11-9 


bkpt 




0x7100 


11-5 


br 




0x7400 


11-5 


bra. f 




0x7500 


11-5 


bra.t 




0x7200 


11-5 


bri.f 




0x7300 


11-5 


bri.t 




0x7600 


11-5 


brs. f 




0x7700 


11-5 


brs.t 




0x2000 


11-10 


call - 


<effa> 


0x2200 


11-14 


callq 


<effa> 


0x2100 


11-11 


calls 


<effa> 


0x6380 


14-8 


cprs . : 


f Vj,Vk 


Ox63cO 


14-8 


cprs . ' 


t Vj,Vk 


0x4080 


9-44 


cvtb .w Aj, Ak 


0x4180 


10-34 


cvtb . i 


* Sj,Sk 
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Add scalar/scalar integer byte 

Add vector/scalar integer byte 

Add vector/vector integer byte 

Add scalar/scalar double float 

Add vector /scalar double float 

Add vector /vector double float 

Add imm. address half word 

Add scalar/immed. integer half word 

Add short imm. address halfword 

Add addr. reg. halfword 

Add scalar/scalar integer halfword 

Add vector/scalar integer halfword 

Add vector/vector integer hal f word 

Add scalar/scalar integer longword 

Add vector/scalar integer longword 

Add vector/vector integer longword 

Add scalar/immed. single float 

Add scalar /scalar single float 

Add vector/scalar single float 

Add vector/vector single float 

Add imm. address word 

Add scalar/immed. integer word 

Add short imm. address word 

Add addr. reg. word 

Add scalar to addr word 

Add scalar/scalar integer word 

Add vector /scalar integer word 

Add vector/vector integer word 

AND reduce a vector 

AND imm. to addr. reg. 

AND scalar/ immediate 

AND addr . reg . 

AND scalar/scalar 

AND vector/scalar 

AND two vectors 

OR reduce a vector 

Breakpoint 

Branch Always 

Branch on address carry false 

Branch on address carry true 

Branch on ION false 

Branch on ION true 

Branch on scalar carry false 

Branch on scalar carry true 

Call a subroutine, make a long frame 

Push the program counter and jump 

Call a subroutine, make a short frame 

Compress a vector using not VM 

Compress a vector using VM 

Convert byte to word 

Convert byte to word 
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0x4340 10-34 cvtd.l S j , Sk 
0x4280 10-34 cvtd.s S j , Sk 
0x40c0 9-44 cvth.w Aj,AK 
0x41c0 10-34 cvth.w S j , Sk 

0x43c0 10-34 cvtl.d S j , Sk 

0x4380 10-34 cvtl.s S j , Sk 

0x4500 10-34 cvtl.w Sj,Sk 

0x42c0 10-34 cvts.d S j , Sk 

0x4300 10-34 cvts.l S j , Sk 

0x4240 10-34 cvts.w S j , Sk 

0x4000 9-44 cvtw.b Aj,Ak 

0x4100 10-34 cvtw.b S j , Sk 

0x4040 9-44 cvtw.h Aj,Ak 

0x4140 10-34 cvtw.h S j , Sk 

0x4540 10- 34 cvtw .1 S j , Sk 

0x4200 10-34 cvtw.s S j , Sk 

Ox7dcO 12-21 diag Ak 

0x5 f 00 10-11 div.b Sj,Sk 

0xf800 13-29 div.b Vi,Sj,Vk 

OxfOOO 13-23 div.b Vi,Vj,Vk 

0x57c0 10-11 div.d Sj,Sk 

0x9e00 13-29 div.d Vi,Sj,Vk 

0x9600 13-23 div.d Vi,Vj,Vk 

0x1700 9-15 div.h #N,Ak 

0x1708 10-20 div.h #N,Sk 

0x5e80 9-15 div.h #n,Ak 

0x5e00 9-10 div.h Aj.Ak 

0x5f40 10-11 div.h Sj,Sk 

OxfaOO 13-29 div.h Vi,Sj,Vk 

Oxf200 13-23 div.h Vi,Vj,Vk 

0x5 fcO 10-11 div.l Sj,Sk 

OxfeOO 13-29 div.l Vi,Sj,Vk 

0xf600 13-23 div.l Vi,Vj,Vk 

0x1988 10-20 div.s #N,Sk 

0x5780 10-11 div.s Sj,Sk 

0x9c00 13-29 div.s Vi,Sj,Vk 

0x9400 13-23 div.s Vi,Vj,Vk 

0x1780 9-15 div.w #N.Ak 

0x1788 10-20 div.w #N, Sk 

OxSecO 9-15 div.w #n,Ak 

0x5e40 9-10 div.w Aj.Ak 

0x5 f 80 10-11 div.w Sj.Sk 

OxfeOO 13-29 div.w Vi,Sj,Vk 

Oxf400 13-23 div.w Vi,Vj,Vk 

0x7d48 12-12 dsi 

0x7d40 12-12 eni 

0x4700 10-27 eq.b S j , Sk 

0x6900 14-6 eq.b Sj,Vk 

0x6800 14-4 eq.b Vj,Vk 

0x5640 10-28 eq.d S j /Sk 

0x6540 14-7 eq.d Sj,Vk 

0x6440 14-5 eq.d Vj.Vk 

OxlbOO 9-41 eq.h #N,Ak 

0xlb08 10-31 eq.h #N,Sk 

Appendix C 



Convert double float to longword 

Convert double float to single float 

Convert half to word 

Convert half word to word 

Convert longword to double float 

Convert longword to single float 

Convert longword to word 

Convert single float to double float 

Convert single float to longword 

Convert single float to word 

Convert word to byte 

Convert word to byte 

Convert word to half word 

Convert word to halfword 

Convert word to longword 

Convert word to single float 

Execute non-standard microcode sequence 

Divide scalar/scalar integer byte 

Divide vector /scalar integer byte 

Divide vector/vector integer byte 

Divide scalar/scalar double float 

Divide vector/scalar double float 

Divide vector /vector double float 

Divide imm. address halfword 

Divide scalar /scalar integer halfword 

Divide short imm. address halfword 

Divide addr. reg. halfword 

Divide scalar/scalar integer halfword 

Divide vector/scalar integer halfword 

Divide vector/vector integer halfword 

Divide scalar/scalar integer longword 

Divide vector/scalar integer longword 

Divide vector/vector integer longword 

Divide scalar/scalar single float 

Divide scalar/scalar single float 

Divide vector/scalar single float 

Divide vector/vector single float 

Divide imm. address word 

Divide scalar/scalar integer word 

Divide short imm. address word 

Divide addr. reg. word 

Divide scalar/scalar integer word 

Divide vector/scalar integer word 

Divide vector/vector integer word 

Disable interrupts , reset ion to 

Enable interrupts, set ion to 1 

Compare equal byte 

Compare equal byte 

Compare equal byte 

Compare equal double float 

Compare equal double precision 

Compare equal double precision 

Compare equal halfword 

Compare equal halfword 
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0x4680 


9-40 


eq.h #n,Ak 


0x4600 


9-37 


eq.h Aj , Ak 


0x4740 


10-27 


eq.h Sj,Sk 


0x6940 


14-6 


eq.h Sj,Vk 


0x6840 


14-4 


eq.h Vj,Vk 


Ox47cO 


10-27 


eq.l Sj,Sk 


0x69c0 


14-6 


eq.l Sj,Vk 


0x68c0 


14-4 


eq.l Vj,Vk 


0x5600 


10-27 


eq.s Sj,Sk 


0x6500 


14-6 


eq.s Sj,Vk 


0x6400 


14-4 


eq.s Vj,Vk 


0xlb80 


9-41 


eq.w #N,Ak 


0xlb88 


10-31 


eq.w #N,Sk 


0x46c0 


9-40 


eq.w #n,Ak 


0x4640 


9-37 


eq.w Aj,Ak 


0x4780 


10-27 


eq.w Sj,Sk 


0x6980 


14-6 


eq.w Sj,Vk 


0x6880 


14-4 


eq.w Vj,Vk 


0x0000 


11-7 


exit 


0x1000 


12-20 


halt #N,Ak 


0x0100 


11-7 


jmp <ef fa> 


Ox0400 


11-7 


jmpa.f <effa> 


0x0500 


11-7 


jmpa.t <effa> 


0x0200 


11-7 


jmpi.f <effa> 


0x0300 


11-7 


jmpi.t <effa> 


0x0600 


11-7 


jmps.f <effa> 


0x0700 


11-7 


jmps.t <effa> 


0x2800 


9-4 


ld.b <effa>,Ak 


0x3000 


10-4 


ld.b <effa>,Sk 


0x3800 


13-8 


ld.b <effa>,Vk 


0x1008 


10-6 


ld.d #N,Sk 


0x3300 


10-4 


ld.d <effa>,Sk 


0x3b00 


13-8 


ld.d <effa>,Vk 


0x1188 


10-6 


ld.dl #N,Sk 


0x1088 


10-6 


ld.du #N,Sk 


0x1100 


9-6 


ld.h #N,Ak 


0x4480 


9-6 


ld.h #n,Ak 


0x2900 


9-4 


ld.h <effa>,Ak 


0x3100 


10-4 


ld.h <effa>,Sk 


0x3900 


13-8 


ld.h <effa>,Vk 


0x1108 


10-6 


ld.l #N,Sk 


0x3300 


10-4 


ld.l <effa>,Sk 


OxOaOO 


16-6 


ld.l <effa>,VLS 


Ox3bOO 


13-8 


ld.l <effa>,Vk 


0x1188 


10-6 


Id. 11 #N,Sk 


0x1088 


10-6 


ld.lu #N,Sk 


0x3200 


10-4 


ld.s <effa>,Sk 


Ox3a00 


13-8 


ld.s <effa>,Vk 


0x1088 


10-6 


ld.u #N,Sk 


0x1180 


9-6 


ld.w #N,Ak 


0x1188 


10-6 


ld.w #N,Sk 


0x1800 


16-3 


ld.w #N,VL 


0x1880 


16-5 


ld.w #N,VS 


0x44c0 


9-6 


ld.w #n,Ak 
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Compare equal halfword 

Compare equal halfword 

Compare equal halfword 

Compare equal halfword 

Compare equal halfword 

Compare equal longword 

Compare equal longword 

Compare equal longword 

Compare equal single float 

Compare equal single 

Compare equal single 

Compare equal word 

Compare equal word 

Compare equal word 

Compare equal word 

Compare equal word 

Compare equal word 

Compare equal word 

Error Exit Instruction 

Halt the central processing unit 

Jump Always 

Jump on address carry false 

Jump on address carry true 

Jump on ION false 

Jump on ION true 

Jump on scalar carry false 

Jump on scalar carry true 

Load addr. reg. byte 

Load scalar byte 

Load vector byte 

Load immediate, most significant bits 

Load scalar double float 

Load vector double float 

Load 64 bit floating immed., lower half 

Load 64 bit floating immed., upper half 

Load halfword imm. into Ak 

Load short imm. into Ak 

Load addr. reg. halfword 

Load scalar halfword 

Load vector halfword 

Load 64 bit sign extended immediate 

Load scalar longword 

Load VS and VL from memory 

Load vector longword 

Load 64 bit integer immed., lower half 

Load 64 bit integer immed. , upper half 

Load scalar single float 

Load vector single float 

Load immediate, upper half 

Load imm. into Ak 

Load a 32 bit immediate 

Load VL with an immediate 

Load VS from an immediate 

Load short imm. into Ak 
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Ox2aOO 

0x3200 

Ox3aOO 

OxObOO 

0x0900 

0x7c08 

0x4400 

Ox7c00 

0x7800 

Ox78cO 

0x7840 

0x78c0 

0x7880 

0x7880 

Ox4d00 

0x6b00 

Ox6aOO 

0x5440 

0x6740 

0x6640 

OxleOO 

OxleOS 

0x4c80 

0x4c00 

0x4d40 

0x6b40 

0x6a40 

Ox4dcO 

Ox6bc0 

0x6ac0 

Oxla08 

0x5400 

0x6700 

0x6600 

0xle80 

Oxle88 

0x4cc0 

0x4c40 

0x4d80 

0x6b80 

Ox6a80 

0x4900 

OxleOO 

0xlc08 

0x4880 

0x4800 

0x4940 

0x49c0 

Oxlc80 

0xlc88 

0x48c0 

0x4840 

0x4980 

0x4f00 



9-4 

10-4 

13-8 

16-10 

9-26 

12-6 

9-33 

12-5 

13-12 

13-12 

13-12 

13-12 

13-12 

13-12 

10-27 

14-6 

14-4 

10-27 

14-6 

14-4 

9-41 

10-31 

9-40 

9-37 

10-27 

14-6 

14-4 

10-27 

14-6 

14-4 

10-31 

10-27 

14-6 

14-4 

9-41 

10-31 

9-40 

9-37 

10-27 

14-6 

14-4 

10-29 

9-42 

10-33 

9-42 

9-39 

10-29 

10-29 

9-42 

10-33 

9-42 

9-39 

10-29 

10-27 



ld.w <effa>,Ak 
ld.w <effa> / Sk 
ld.w <effa>,Vk 
ld.x <effa>, VM 
Idea <effa>,Ak 
ldkdr Ak 
ldpa Aj,Ak 
ldsdr Ak 
ldvi.b Vj,Vk 
ldvi.d Vj,Vk 
ldvi.h Vj,Vk 
ldvl.l Vj,Vk 
ldvi.s Vj,Vk 
ldvi.w Vj,Vk 
le.b Sj,Sk 
le.b Sj,Vk 
le.b Vj,Vk 
le.d Sj,Sk 
le.d Sj.Vk 
le.d Vj,Vk 
le.h #N,Ak 
le.h #N,Sk 
le.h #n,Ak 
le.h Aj,Ak 
le.h Sj,Sk 
le.h Sj,Vk 

le.h Vj,Vk 
.1 Sj,Sk 
.1 Sj.Vk 

le.l Vj,Vk 

le.s #N.Sk 

le.s Sj,Sk 

le.s Sj,Vk 

le.s Vj,Vk 

le.w #N,Ak 

le.w #N,Sk 

le.w #n,Ak 

le.w Aj,AK 

le.w Sj,Sk 

le.w Sj,Vk 

le.w Vj,Vk 

leu . b S j , Sk 

leu.h #N,Ak 

leu.h #N,Sk 

leu.h #n,Ak 

leu.h Aj ,Ak 

leu.h Sj,Sk 

leu.l Sj,Sk 

leu.w #N,Ak 

leu.w #N,Sk 

leu.w #n,Ak 

leu.w Aj,Ak 

leu.w Sj.Sk 

lt.b Sj,Sk 



le. 
le. 



Load addr. reg. word 

Load scalar word 

Load vector word 

Load VM from memory 

Load effective address 

Load all 8 SDR's 

Load a physical byte address into Ak 

Load process SDR's 

Index Load vector byte 

Index Load vector double float 

Index Load vector halfword 

Index Load vector longword 

Index Load vector single float 

Index Load vector word 

Compare less than or equal byte 

Compare less than or equal byte 

Compare less than or equal byte 

Compare less than or equal double float 

Compare less than or equal double float 

Compare less than or equal double float 

Compare less than or equal halfword 

Compare less than or equal halfword 

Compare less than or equal halfword 

Compare less than or equal signed halfword 

Compare less than or equal halfword 

Compare less than or equal halfword 

Compare less than or equal halfword 

Compare less than or equal longword 

Compare less than or equal longword 

Compare less than or equal longword 

Compare less than or equal single 

Compare less than or equal single float 

Compare less than or equal single 

Compare less than or equal single 

Compare less than or equal word 

Compare less than or equal word 

Compare less than or equal word 

Compare less than or equal signed word 

Compare less than or equal word 

Compare less than or equal word 

Compare less than or equal word 

Compare less than or equal byte 

Compare unsigned less than halfword 

Compare unsigned less than or equal halfword 

Compare unsigned less than or equal halfword 

Compare unsigned less than or equal halfword 

Compare less than or equal halfword 

Compare less than or equal longword 

Compare unsigned less than word 

Compare unsigned less than or equal word 

Compare unsigned less than or equal word 

Compare unsigned less or equal than word 

Compare less than or equal word 

Compare less than byte 
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0x6d00 14-6 lt.b Sj,Vk 

Ox6cOO 14-4 lt.b Vj,Vk 

0x54c0 10-27 lt.d S j , Sk 

0x67c0 14-6 lt.d Sj,Vk 

Ox66cO 14-4 lt.d Vj,Vk 

OxlfOO 9-41 lt.h #N,Ak 

0xlf08 10-31 lt.h #N,Sk 

0x4e80 9-40 lt.h #n,Ak 

0x4e00 9-37 lt.h Aj,Ak 

0x4f40 10-27 lt.h S j , Sk 

0x6d40 14-6 lt.h Sj,Vk 

0x6c40 14-4 lt.h Vj,Vk 

0x4fc0 10-27 lt.l Sj,Sk 

Ox6dcO 14-6 lt.l Sj,Vk 

0x6cc0 14-4 lt.l Vj,Vk 

0xla88 10-31 lt.s #N,Sk 

0x5480 10-27 lt.s S j , Sk 

0x6780 14-6 lt.s Sj,Vk 

0x6680 14-4 lt.s Vj,Vk 

Oxl f 80 9-41 lt.w #N,Ak 

Oxlf88 10-31 lt.w #N,Sk 

0x4ec0 9-40 lt.w #n,Ak 

0x4e40 9-37 lt.w Aj,Ak 

0x4f80 10-27 lt.w Sj,Sk 

0x6d80 14-6 lt.w Sj,Vk 

0x6c80 14-4 lt.w Vj.Vk 

0x4b00 10-29 ltu.b S j , Sk 

OxldOO 9-42 ltu.h #N,Ak 

0xld08 10-33 ltu.h #N,Sk 

0x4a80 9-42 ltu.h #n,Ak 

0x4b40 10-29 ltu.h S j , Sk 

0x4a00 9-39 ltu.h gt,Aj,Ak 

0x4bc0 10-29 ltu.l S j , Sk 

Oxld80 9-43 ltu.w #N,Ak 

0xld88 10-33 ltu.w #N,Sk 

0x4ac0 9-42 ltu.w #n,Ak 

0x4b80 10-29 ltu.w S j , Sk 

0x4a40 9-39 ltu.w gt,Aj,Ak 

0x8a00 14-14 mask.f Vi,Sj,Vk 

0x8e00 14-14 mask.t Vi,Sj,Vk 

0x8600 14-13 mask.t Vi,Vj,Vk 

0x7e40 15-6 max.b Vk 

0x7ea8 15-6 max.d Vk 

0x7e48 15-6 max.h Vk 

0x7e58 15-6 max.l Vk 

Ox7ea0 15-6 max.s Vk 

0x7e50 15-6 max.w Vk 

0x8800 14-12 merg.f Vi,Sj,Vk 

0x8c00 14-12 merg.t Vi,Sj,Vk 

0x8400 14-10 merg.t Vi,Vj,Vk 

0x7e60 15-7 min.b Vk 

0x7eb8 15-7 min.d Vk 

0x7e68 15-7 min.h Vk 

0x7e78 15-7 min.l Vk 



Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare less than 

Compare less than 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 

Compare 



less than byte 

less than byte 

less than double float 

less than double float 

less than double float 

less than half word 

less than half word 

less than half word 

less than signed halfword 

less than halfword 

less than halfword 

less than halfword 

longword 

longword 
less than longword 
less than single 



less than single float 

less than single 

less than single 

less than word 

less than word 

less than word 

less than signed word 

less than word 

less than word 

less than word 

less than byte 
Compare unsigned less than halfword 
Compare unsigned less than halfword 
Compare unsigned less than halfword 
Compare less than halfword 
Compare unsigned less than halfword 
Compare less than longword 
Compare unsigned less than word 
Compare unsigned less than word 
Compare unsigned less than word 
Compare less than word 
Compare unsigned less than word 
Mask vector/scalar using not VM 
Mask vector/scalar using VM 
Mask vector /vector 
Max of a vector of bytes 

vector of double float 
vector of ha If words 
vector of longwords 
vector of single float 
vector of words 
Merge vector/scalar using not VM 
Merge vector/scalar 
Merge vector/vector 
Min of a vector of bytes 
Min of a vector of double float 
Min of a vector of ha If words 
Min of a vector of longwords 



Max 


of 


a 


Max 


of 


a 


Max 


of 


a 


Max 


of 


a 


Max 


of 


a 
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Ox7ebO 

0x7e70 

0x5080 

0x51c0 

0x7d98 

0x7d88 

0x7c48 

0x7c60 

0x7c50 

0x7c40 

0x8200 

0x50c0 

0x6100 

0x6140 

0x7c68 

0x7E70 

0x7c78 

0x7d90 

0x7d80 

0x8000 

0x5180 

0x5180 

0x5100 

0x5100 

0x7cb8 

0x7ca8 

0x7d60 

.0x5d00 

0xe800 

OxeOOO 

0x5740 

0x9a00 

0x9200 

0x1600 

0x1608 

0x5c80 

0x5c00 

0x5d40 

OxeaOO 

0xe200 

0x5dc0 

OxeeOO 

Oxe600 

0x1908 

0x5700 

0x9800 

0x9000 

0x1680 

0x1688 

0x5cc0 

0x5c40 

0x5d80 

OxeeOO 

0xe400 



15-7 

15-7 

9-29 

10-40 

16-2 

16-4 

9-36 

12-16 

9-28 

9-35 

13-39 

10-39 

16-13 

16-13 

12-15 

12-23 

12-17 

16-2 

16-4 

13-40 

10-41 

10-41 

10-41 

10-41 

16-8 

16-9 

12-19 

10-10 

13-28 

13-22 

10-10 

13-28 

13-22 

9-14 

10-19 

9-14 

9-9 

10-10 

13-28 

13-22 

10-10 

13-28 

13-22 

10-19 

10-10 

13-28 

13-22 

9-14 

10-19 

9-14 

9-9 

10-10 

13-28 

13-22 



min.s Vk 
min.w Vk 
mov Aj,Ak 
mov Aj,Sk 
mov Ak,VL 
mov Ak,VS 
mov Ak,psw 
mov ITR, Sk 
mov PC,Ak 
mov PSW,Ak 
mov Si,Sj,Vk 
mov Sj,Ak 
mov Sj,Sk,VM 
mov Sj,VM, Sk 
mov Sk, ITR 
mov Sk,W 
mov Sk, itsr 
mov VL,Ak 
mov VS,Ak 
mov Vl,Sj,Sk 
mov.d Sj,Sk 
mov .1 S j , Sk 
mov.s Sj,Sk 
mov.w Sj,Sk 
mov . w Sk , VL 
mov.w Sk,VS 
mskl Sk 
mul.-b Sj,Sk 
mul.b Vi,Sj,Vk 
mul.b Vi,Vj,Vk 
mul . d S j , Sk 
mul.d Vi,Sj,Vk 
mul.d Vi,Vj,Vk 
mul.h #N,Ak 
mul.h #N,Sk 
mul.h #n,Ak 
mul .h Aj,Ak 
mul.h Sj,Sk 
mul.h Vi,Sj,Vk 
mul.h Vi,Vj,Vk 
mul.l Sj,Sk 
mul.l Vi,Sj,Vk 
mul.l Vi,Vj,Vk 
mul.s #N,Sk 
mul . s S j , Sk 
mul.s Vi,Sj,Vk 
mul.s Vi,Vj,Vk 
mul.w #N,Ak 
mul.w #N,Sk 
mul.w #n,Ak 
mul.w Aj,AK 
mul.w Sj,Sk 
mul.w Vi,Sj,Vk 
mul.w Vi,Vj,Vk 



Min of a vector of single float 

Min of a vector of words 

Move addr. reg. 

Move an address to a scalar 

Move Ak to VL 

Move Ak to VS 

Load an addr. reg. into the PSW 

Move the itc, itsr,nitc into Sk 

Load next PC address 

Store the PSW into an addr. reg. 

Move a scalar to a vector element 

Move 32 bits of Sj into Ak. 

Load VM(Sj) from Sk. 

Load Sk from VM 

Load NITC, ITC, ITSR from Sk 

Move scalar to vector valid flag 

Load ITSR with a scalar 

Move VL to Ak 

Move VS to Ak 

Move a vector element to a scalar 

Move scalar register single float 

Move scalar register longword 

Move scalar register double float 

Move scalar register word 

Move Sk to VL 

Move Sk to VS 
Mask Out Interrupt 

Multiply scalar /scalar integer byte 
Multiply vector /scalar integer byte 
Multiply vector/vector integer byte 
Multiply scalar/scalar double float 
Multiply vector/scalar double float 
Multiply vector/vector double float 
Multiply imm. address halfword 
Multiply scalar/immed. integer halfword 
Multiply short imm. address halfword 
Multiply addr. reg. halfword 
Multiply scalar /scalar integer halfword 
Multiply vector/scalar integer halfword 
Multiply vector/vector integer halfword 
Multiply scalar/scalar integer longword 
Multiply vector/scalar integer longword 
Multiply vector/vector integer longword 
Multiply scalar/immed. single float 
Multiply scalar/scalar single float 
Multiply vector/scalar single float 
Multiply vector/vector single float 
Multiply imm. address word 
Multiply scalar/immed. integer word 
Multiply short imm. address word 
Multiply addr. reg. word 
Multiply scalar/scalar integer word 
Multiply vector/scalar integer word 
Multiply vector/vector integer word 
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Ox6fOO 10-12 neg.b S j , Sk 

0x6e00 13-25 neg.b Vj,Vk 

0x65c0 10-12 neg.d S j , Sk 

0x64c0 13-25 neg.d Vj,Vk 

0x5680 9-11 neg.h Aj,Ak 

0x6f40 10-12 neg.h S j , Sk 

0x6e40 13-25 neg.h Vj,Vk 

0x6fc0 10-12 neg.l S j , Sk 

0x6ec0 13-25 neg.l Vj,Vk 

0x6580 10-12 neg.s S j , Sk 

0x6480 13-25 neg.s Vj,Vk 

0x56c0 9-11 neg.w Aj,Ak 

0x6 f 80 10-12 neg.w Sj,Sk 

0x6e80 13-25 neg.w Vj,Vk 

0x7000 11-5 nop 

0x52c0 9-20 not Aj,Ak 

0x53c0 10-16 not S j , Sk 

0x62c0 13-34 not Vj,Vk 

0x1280 9-22 or #N,Ak 

0x1288 10-22 or #N.Sk 

0x5240 9-18 or Aj,Ak 

0x5340 10-14 or S j , Sk 

OxaaOO 13-36 or Vi,Sj,Vk 

0xa200 13-32 or Vi,Vj,Vk 

0x7e30 15-10 parity Vk 

0x7c28 12-9 pate Ak 

0x7c20 12-8 patu 

0x7c30 12-13 pich 

0x7ee0 16-12 plc.f VM,Sk 

0x4580 10-38 plc.t S j , Sk 

0x7ee8 16-12 plc.t VM,Sk 

0x6340 15-11 plc.t Vj,Vk 

0x7c38 12-14 plch 

0x7d38 10-26 pop.l Sk 

Ox7dlO 9-31 pop.w Ak 

0x7d30 10-26 pop.w Sk 

0x7ec0 15-4 prod.b Vk 

0x7e98 15-4 prod.d Vk 

0x7ec8 15-4 prod.h Vk 

0x7ed8 15-4 prod.l Vk 

0x7e90 15-4 prod.s Vk 

0x7ed0 15-4 prod.w Vk 

0x7d28 10-25 psh.l Sk 

Ox7dOO 9-30 psh.w Ak 

0x7d20 10-25 psh.w Sk 

OxOdOO 9-27 pshea <effa> 

0x7c90 11-12 rtn 

0x7ca8 12-10 rtnc 

0x7c80 11-15 rtnq 

0x1380 9-25 shf #N,AK 

0x1388 10-24 shf #N,Sk 

0x4440 9-25 shf #n,Ak 

0x5040 9-24 shf Aj,Ak 

0x5140 10-36 shf S j , Sk 



Negate scalar/scalar integer byte 

Negate vector/vector integer byte 

Negate scalar/scalar double float 

Negate vector/vsctor double float 

Negate addr . reg. half word 

Negate scalar/scalar integer halfword 

Negate vector/vector integer halfword 

Negate scalar/scalar integer longword 

Negate vector/ vector integer longword 

Negate scalar/scalar single float 

Negate vector/vector single float 

Negate addr. reg. word 

Negate scalar/scalar integer word 

Negate vector/vector integer word 

No Operation 

Complement addr. reg. 

Complement scalar /scalar 

Complement a vector 

OR imm. to addr. reg. 

OR scalar/immediate 

OR addr. reg. 

OR scalar/scalar 

OR vector/scalar 

OR two vectors 

Exclusive OR reduce a vector 

Purge ATU entry 

Purge the entire ATU 

Purge the instruction cache 

Load the number of O's in VM into Sk 

Count the number of 1 ' s in S j 

Load the number of l's in VM into Sk 

Population Count of a Vector 

Purge the logical cache 

Pop Sk<63..0> from the stack. 

Pop word into addr. reg. 

Pop Sk<31..0> from the stack 

Multiply reduce a vector of bytes 

a vector of double float 
a vector of ha If words 
a vector of longwords 
a vector of single float 



Multiply reduce 
Multiply reduce 
Multiply reduce 
Multiply reduce 



Multiply reduce a vector of words 

Push Sk<63..0> onto the stack. 

Push an addr. reg. 

Push Sk<31..0> onto the stack 

Push effective address 

Return from subroutine call 

Return from a context block 

Pop the program counter and jump 

Logical shift imm. to addr. reg. 

Shift Scalar/immediate 

Logical shift left short imm. 

Shift an address 

Shift a scalar 



Appendix C 



Page C-7 



0x6300 13-38 shf Sj,Vk 

0x2c00 9-5 st.b Ak,<effa> 

0x3400 10-5 st.b Sk,<effa> 

0x3c0O 13-10 st.b Vk,<effa> 

0x3700 10-5 st.d Sk, <ef fa> 

0x3f00 13-10 st.d Vk,<effa> 

0x2d00 9-5 st.h Ak,<effa> 

0x3500 10-5 st.h Sk,<effa> 

0x3d00 13-10 st.h Vk,<effa> 

0x370O 10-5 st.l Sk,<effa> 

OxOeOO 16-7 st.l VLS,<effa> 

0x3 fOO 13-10 st.l Vk,<effa> 

0x3600 10-5 st.s Sk,<effa> 

Ox3eOO 13-10 st.s Vk,<effa> 

Ox2e00 9-5 st.w Ak,<effa> 

0x3600 10-5 st.w Sk,<effa> 

0x3e00 13-10 st.w Vk,<effa> 

OxOfOO 16-11 st.x VM, <effa> 

0x2400 13-18 ste.b Sk, <ef fa> 

0x2700 13-18 ste.d Sk,<effa> 

0x2500 13-18 ste.h Sk,<effa> 

Ox270O 13-18 ste.l Sk,<effa> 

0x2600 13-18 ste.s Sk, <ef fa> 

0x2600 13-18 ste.w Sk, <ef fa> 

0x7b00 13-16 stvi.b Sk,Vj 

0x7a00 13-14 stvi.b Vk,Vj 

0x7bc0 13-16 stvi.d Sk,Vj 

0x7ac0 13-14 stvi.d Vk,Vj 

0x7b40 13-16 stvi.h Sk,Vj 

0x7a40 13-14 stvi.h Vk,Vj 

0x7bc0 13-16 stvi.l Sk,Vj 

0x7ac0 13-14 stvi.l Vk,Vj 

0x7b80 13-16 stvi.s Sk,Vj 

0x7a80 13-14 stvi.s Vk,Vj 

0x7b80 13-16 stvi.w Sk,Vj 

0x7a80 13-14 stvi.w Vk,Vj 

Ox5b00 10-9 sub.b Sj,Sk 

Oxd800 13-27 sub.b Vi,Sj,Vk 

OxdOOO 13-21 sub.b Vi,Vj,Vk 

0x55c0 10-9 sub.d S j , Sk 

OxbeOO 13-27 sub.d Vi,Sj,Vk 

Oxb600 13-21 sub.d Vi,Vj,Vk 

0x1500 9-13 sub.h #N,Ak 

0x1508 10-18 sub.h #N, Sk 

0x5a80 9-13 sub.h #n,Ak 

0x5a00 9-8 sub.h Aj,Ak 

0x5b40 10-9 sub.h S j , Sk 

OxdaOO 13-27 sub.h Vi,Sj,Vk 

Oxd200 13-21 sub.h Vi,Vj,Vk 

0x5bc0 10-9 sub.l S j , Sk 

OxdeOO 13-27 sub.l Vi,Sj,Vk 

0xd6O0 13-21 sub.l Vi,Vj,Vk 

0x1888 10-18 sub.s #N,Sk 

0x5580 10-9 sub.s S j , Sk 
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Shift a vector accumulator 

Store addr. reg. byte 

Store scalar byte 

Store vector byte 

Store scalar double float 

Store vector double float 

Store addr. reg. ha If word 

Store scalar halfword 

Store vector halfword 

Store scalar longword 

Store VS and VL to memory 

Store vector longword 

Store scalar single float 

Store vector single float 

Store addr. reg. word 

Store scalar word 

Store vector word 

store VM into memory 

Store an extended scalar byte 

Store an extended scalar double float 

Store an extended scalar halfword 

Store an extended scalar longword 

Store an extended scalar single float 

Store an extended scalar word 

Scalar Index Store vector byte 

Index Store vector byte 

Scalar Index Store vector double float 

Index Store vector double float 

Scalar Index Store vector halfword 

Index Store vector halfword 

Scalar Index Store vector longword 

Index Store vector longword 

Scalar Index Store vector single float 

Index Store vector single float 

Scalar Index Store vector word 

Index Store vector word 

Subtract scalar /scalar integer byte 

Subtract vector/scalar integer byte 

Subtract vector/vector integer byte 

Subtract scalar/scalar double float 

Subtract vector/scalar double float 

Subtract vector /vector double float 

Subtract imm. address halfword 

Subtract scalar /immed. integer halfword 

Subtract short imm. address halfword 

Subtract addr. reg. halfword 

Subtract scalar /scalar integer halfword 

Subtract vector/scalar integer halfword 

Subtract vector/vector integer halfword 

Subtract scalar/scalar integer longword 

Subtract vector /scalar integer longword 

Subtract vector/vector integer longword 

Subtract scalar/immed. single float 

Subtract scalar/scalar single float 



Page C-8 



OxbcOO 


13-27 


sub.s 


Vi,Sj,Vk 


Oxb400 


13-21 


sub.s 


Vi , V j , Vk 


0x1580 


9-13 


sub.w 


#N,Ak 


0x1588 


10-18 


sub.w 


#N,Sk 


0x5ac0 


9-13 


sub.w 


#n,AK 


0x5a40 


9-8 


sub.w 


Aj,Ak 


0x5b80 


10-9 


sub.w 


Sj,Sk 


OxdcOO 


13-27 


sub.w 


Vi,Sj,Vk 


0xd4Q0 


13-21 


sub.w 


Vi,Vj,Vk 


0x7e00 


15-2 


sum.b 


Vk 


0x7e88 


15-2 


sum.d 


Vk 


0x7e08 


15-2 


sum.h 


Vk 


0x7el8 


15-2 


sum.l 


Vk 


0x7e80 


15-2 


SUB.S 


Vk 


0x7el0 


15-2 


sum.w 


Vk 


0x1080 


11-15 


sysc i 


*r,#g 


OxOcOO 


9-32 


tas <effa> 


0x7E78 


12-24 


tstvv 




0x45c0 


10-37 


tzc Sj,Sk 


0x7d68 


12-18 


xmti ! 


3k 


0x1300 


9-23 


xor #N,AK 


0x1308 


10-23 


xor #N,Sk 


0x5280 


9-19 


xor Aj , Ak 


0x5380 


10-15 


xor Sj,Sk 


OxacOO 


13-37 


xor Vi,Sj,Vk 


0xa400 


13-33 


xor Vi,Vj,Vk 



Subtract vector/scalar single float 

Subtract vector/vector single float 

Subtract imm. address word 

Subtract scalar/immed. integer word 

Subtract short imm. address word 

Subtract addr. reg. word 

Subtract scalar/scalar integer word 

Subtract vector/scalar integer word 

Subtract vector/vector integer word 

Sum a vector o f bytes 

Sum a vector of double float 

Sum a vector of half words 

Sum a vector of longwords 

Sum a vector of single float 

Sum a vector of words 

Perform a system call 

Test and Set a memory byte 

Test value of vector valid flag 

Count of trailing zeroes in Sj 

Transmit Interrupt 

Exclusive OR imm. to addr. reg. 

Exclusive OR scalar/immediate 

Exclusive OR addr. reg. 

Exclusive OR scalar /scalar 

Exclusive OR vector/scalar 

Exclusive OR two vectors 
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APPENDIX D 
FLOATING POINT ALGORITHMS 



OVERVIEW 



This appendix details the floating point algorithms used by the CONVEX 
instruction set. These algorithms involve rounding, sequencing of opera- 
tions, and other considerations. 

The CONVEX format for a double precision floating point operand in memory 



Exponent | 



Fraction 



63,62 

BYTE 



52,51 




BYTE 7 



The CONVEX format for a single precision floating point operand in memory 
is: 



S | Exponent | 

23,22 



Fraction 



31,30 
BYTE 




BYTE 3 



AJ2B/S 

The fraction of the floating point operands are expanded internally as fol- 
lows: 

1 A 1 is appended to the higher bit position of the fraction. 

2 Two guard bits are appended to the right of the least signifi- 
cant fraction bit. These bits are referred to as G and R in 
that order. 

3 A sticky bit is appended to the right of the two guard bits. The 
sticky or S bit is the OR of all bits to the right of the R bit. 

4 An additional bit is appended to the higher fraction, the V bit, 
for overflow. 

Thus the internal floating point format is: 
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The initial values of the V,G,R, and S bits are all 0. 

5 The exponents of the two fractions are compared. The fraction 
of the smaller exponent is shifted right an amount equal to the 
absolute difference of the exponents. All right shifted bits 
are shifted through the G, R, and S bits. 

6 Any binary l's shifted past the 2 guard bits are remembered in 
S. 

7 If any of the input operands were zero (Sign is 0, and Exponent 
is 0) no shifting occurs. 

8 If any of the input operands was reserved (Sign is 1, and 
Exponent is 0) , no shifting occurs, a reserved operand exception 
occurs, and the output of the ADD/SUB is a reserved operand. 

9 Otherwise, the two fractions are algebraically added/subtracted 
according to the sign and opcode. 

10 The result is normalized. If V became 1, the intermediate 
result is right shifted one bit position. R is ORed with S. If a 
generated subtract was performed, the intermediate result is 
left shifted until a normalized intermediate result is obtained. 
S need not participate in the left shifts; zero or S may be 
shifted into R from the right. G is loaded with R. S is always 
unchanged. 

11 The intermediate normalized result is rounded as follows: 



G R S Round Performed (to LSB) 



Add 

1 Add 

10 Add 

11 Add 

10 Add LSB of fraction - round to nearest even. 

10 1 Add 1 

110 Add 1 

111 Add 1 

12 The rounded intermediate result is normalized again and the exponent 
is adjusted if necessary to yield the final result. 



Multiplication of 2 normalized floating point numbers produces an 
intermediate result that is either normalized or at most requires one 
left shift. The steps for multiplication are as follows: 
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1 If either of the two operands is a reserved operand, the result is a 
reserved operand. 

2 If either of the two operands is zero (Sign is 0, and Exponent is 

) , then the result is a true zero (Sign, Exponent, and fraction of all 
0). 

3 Otherwise, add the exponents, keeping an extra bit of precision to 
account for a normalization shift that could correct an exponent 
overflow. 

4 Multiply the two fractions right to left. 

5 Maintain the G, R, and S bits during intermediate calculations. 

6 Post normalize if required. 

7 Round the intermediate result per step 11 in the description in ADD/SUB. 

8 The rounded intermediate result is normalized again and the exponent 
adjusted if necessary to yield the final result. 



DIVIDE 

Division of 2 normalized floating point numbers produces an intermediate 
result that is normalized. 



1 If either of the two operands is a reserved operand, the result 
is a reserved operand. 

2 If the divisor is zero, then the result is a reserved operand. 
Also the FDZ bit in the PSW is set to 1. 

3 Subtract the exponents producing the result exponent. 

4 Divide the numerator by the denominator mantissa until a normal- 
ized result is obtained. A (n+2) bit quotient is generated 
where n is the length of the mantissa of the operands. The two 
additional quotient bits represent the G and R bits. The state 
of the S bit is implementation specific. The S bit may always 
be assumed to be 0, or may represent the OR of some portion if 
not the entire remainder. 

5 Round the intermediate result per step 11 in the description of 
ADD/SUB. 

6 The rounded intermediate result is normalized again and the 
exponent adjusted if necessary to yield the final result. 
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Converting from float to fix: always round toward (truncate). This 
conversion obeys the FORTRAN rounding convention. 

Converting from fix to float: properly normalize the integer, then truncate 
to the appropriate mantissa size. 

Please note that rounding from float to fix can be achieved by first adding 
.5 to the floating point operand and then by executing the float to fix 
instruction. Thus, RND (3.4) equals TRUNCATE (3.9) = 3 and RND (3.5) 
equals TRUNCATE (4.0) = 4. 
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APPENDIX E 
ASSEMBLER NOTATION 



Introduction 

The CONVEX instruction set has eight possible operand addressing modes: 
register mode, immediate operands and six modes for specifying operands in 
memory. Since the CONVEX architecture is based on three sets of high speed 
registers, most of the instructions are limited to register operands only. 
Each addressing mode is described in detail in the following sections. 

Instruction E2rma£ 

An assembly language program is a sequence of instructions. These instruc- 
tions may be machine instructions that will be translated into machine 
language instructions, or they may be directives to the assembler. Both 
types of instructions follow the same basic format, as follows. 

The instruction is composed of five fields: label, mnemonic, operand list, 
comment, and the terminator. The only field that is required is the termi- 
nator field (exclamation point or newline) . The format for an instruction 
is shown below: 

[label:] mnemonic [operand list] [;comment] terminator 



Sample code: 
Arg = 4 



loop: 



eq.w 


#0,s0 


jmps. t 


exit 


add.w 


#l,sO 


ld.w 


arg(a2) , si 


add.w 


sl,sO 


br 


loop 



exit: 



sing Mcdss 

Register Mods 

The majority of the instructions in the instruction set requires one or 
more register mode operands. A register mode operand specifies the regis- 
ter set and the particular register in the set to be used as the operand. 
The actual machine instruction generated by the assembler will depend on 
both the register set used and the register within that register set. 
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General register operands are specified by a letter and a number. The 
letter denotes the register set, and the number specifies the register 
number in the set. The general register sets for the machine are the 
address registers (A) , the scalar registers (S) , and the vector registers 
(V) . The register set letter can be specified in either upper case or lower 
case. Each register set contains eight registers denoted by the numbers 
zero to seven. 

To aid in program readability, three of the address registers can also be 
referenced by special names. These are the Stack Pointer (SP) , Argument 
Pointer (AP) , and Frame Pointer (FP) . 

The assembler itself uses reserved words to denote the registers in the 
machine. Each register is referenced by the reserved words only; hence 
additional symbols may not be defined to denote machine registers. 

There are also ten special purpose registers — items 8 through 16 listed in 
the Register Syntax Summary Table below — in the machine that are used for 
machine control. These registers are specified by reserved words. 

Immediate Operands 

Immediate operands provide a method for referencing data in the program's 
instruction stream. The assembler syntax for specifying an immediate 
operand is written as "#" followed by the expression that defines the value 
of the immediate operand in question. 

MemorV Addressing Msdes 

In addition to the register and immediate addressing modes, there are six 

modes for specifying operands in memory. In general, these modes are used 

for loading registers from memory and for storing the contents of a regis- 
ter in memory. 

Absolute Addressing Mode 

A program can reference an absolute address in the virtual address space by 
specifying the location desired. This specification can be done by using 
an expression that evaluates to the address of the location desired: 

ld.w addr,reg 

Register Deferre d Made 

Via the register deferred mode, the address registers can contain the 
address of an operand to be used by the instruction. Thus, the register 
contains a pointer to the operand rather than the operand itself. This 
addressing mode is specified by enclosing the address register that con- 
tains the pointer in parentheses: 

ld.w (INDEX reg) , reg 

Indexed Made 
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The indexed mode adds an offset or base to the contents of the address 
register specified and uses the result as a pointer to the operand. This 
addressing mode is formed by preceding a deferred register specifier with 
an expr es s i on : 

ld.w addr (INDEX reg) , reg 

Indirect Aksclnte Mode 

This mode provides for one level of indirection from an absolute address.. 
The operand pointer is located at the absolute address specified by an 
expression. This addressing mode is specified by preceding an expression 
for the absolute location by the character "(§" . 

ld.w ©addr, reg 

Indirect Deferred Mods 

This mode provides an additional level of indirection over the deferred 
mode. The register specified contains the address of the pointer to the 
desired operand. The character "@" , followed by a deferred register 
operand, specifies this addressing mode. 

ld.w @ (INDEX reg), reg 

Indirect Indexed Mode 

Indirect indexed mode is denoted by preceding an indexed operand by the 
character "@" . In this mode, the value of the register and the value of 
the index expression are added together to form the address of a pointer to 
the desired operand: 

ld.w @ addr (INDEX reg), reg 
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Table E-l: Register Syntax Summary 

Symbolic Assembled Mode 

1. A or a Address Register Set 

2 - S or s Scalar Register Set 

3 - V or v Vector Register Set 

4. SP or sp Stack Pointer (AO) 

5. AP or ap Argument Pointer (A6) 

6. FP or fp Frame Pointer (A7) 

7. VS or vs Vector Stride Register 

8. VLor vl Vector Length Register 

9. VLS or vis Vector Stride and Length Combination 

10. PSW or psw Processor Status Word 

11. PC or pc Program Counter 

12. ITR or itr Interval Timer Register 

13. ITSR or itsr Interval Timer Status Register 
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Table E-2: Addressing Modes Syntax Summary- 
Assembled Mode 
immediate 



Symbolic 


1. 


#n 


2. 


n 


3. 


R 


4. 


(Rn) 


5. 


n(Rn) 


6. 


@n 


7. 


@(Rn) 


8. 


@n (Rn) 



absolute or PC 
relative 

register 

register deferred mode 

indexed mode 

indirect absolute mode 

indirect deferred mode 

indirect indexed mode 
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APPENDIX F 
GLOSSARY OF TECHNICAL TERMS USED IN THIS MANUAL 

Access Mode 

Any of the five processor access modes in which software executes. On 
the CONVEX system, processor access modes are: (in order from most to 
least privileged and protected) : kernel (mode 0) , executive (mode 1) , 
supervisor (mode 2) , agent (mode 3) and user (mode 4) . The operating 
system uses access modes to define protection levels for software exe- 
cuting in the context of a process. 

Accumulator 

A hardware register. This register contains the results of arithmetic 
and logical operations. 

Address 

User assigned number used by the operating system to identify a 
storage location. 

Addressing mode 

How the effective address of an instruction operand is calculated 
using the general registers. 

Address space 

Address space, either physical or virtual, available to a process. 

Address Translation Fault 

An exception that results from a PTE violation or a non-resident page. 

Address Translation Unit 

The address translation unit (ATU) translates logical addresses to 
physical addresses and stores them in a cache. Thus the ATU is an 
address cache which accelerates the generation of physical addresses. 

Architecture 

The conceptual structure and functional behavior of the system. 

Argument Pointer 

An address register specifically dedicated to point to the subroutine 
argument portion of a program. This program portion can either be in 
the stack or in part of logical memory pre-al located by the compiler. 

Arrays 

An ordered structure of operands of the same data type. The structure 
of an array is defined as: length, rank or dimension, stride, and data 
type. 

Base- level interrupt 

An interrupt which occurs when the kernel stack is the process stack. 
A base-level interrupt is thus an interrupt which occurs when no other 
interrupts are pending or currently being processed. 

Bit 

A binary digit. 
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Bit complement 

Exchanging O's and l's in the binary representation of a number (also 
known as l's complement). 

Bootstrap 

The procedure by which a program is initiated the first time. Typi- 
cally a bootstrap is performed when power is first applied to the pro- 
cessor. 

Branch 

A class of instructions used to transfer control of a program, specif- 
ically relative to the Program Counter. 

Breakpoint 

An instruction which aids in the debugging of a program. In particu- 
lar, a breakpoint is a particular location in a program that one would 
desire to determine the various values of programmer -de fined vari- 
ables. 

Byte (b) 

A byte is a number of contiguous bits starting on an addressable byte 
boundary. In CONVEX machines, a byte is eight bits. 

C 

The systems programming language of the UNIX operating system. 

Cache 

(See Logical , Physical , Instruction) . 

Cache memory 

A small, high-speed memory placed between main memory and the proces- 
sor and transparent to the user. CONVEX computers contain many 
separate caches. 

Cache purge 

The act of invalidating or removing entries in a cache memory. 

Central processing unit 

The central processing unit is the portion of a CONVEX machine which 
recognizes and executes the instruction set. 

Chaining 

Chaining is the ability to overlap vector operations in the central 
processing unit. For instance, in the case of a vector load followed 
by a vector add, the add may be started as soon as the first operands 
are available rather than waiting for the load to complete. 

Compiler 

Software tool used to compile a high-level language (e.g., Fortran) 
into assembly code. 

Context (processor) 

The entire, current state of the machine associated with the executing 
process . 
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Data type 

The way in which bits are grouped and interpreted. For processor 
instructions, the data type identifies the size of the operand and the 
significance of the bits in the operand. 

Destination 

The operand specified in an instruction which receives the result of 
the operation. 

Disp 1 acement 

A derived 32-bit value used to indicate the distance in bytes that the 
referenced datum is relative to some base value. This base value can 
either be or the contents of an address register. Please note that 
16-bit displacement values are sign extended to 32 bits. 

Double (d) 

A double precision floating point number, stored in 64 bits. 

Exception 

An exception is a hardware-detected event which disrupts the running 
of a program, process, or system. 

Fault 

An exception, which, while halting the instruction, leaves the regis- 
ters and memory in a consistent state. The instruction can often 
resume its course when the cause of the fault is corrected. 

Elag 

A 1-bit operand that is generally used to indicate the results of an 
operation. The results are in the form true or false. 

Floating point 

A numerical representation. A floating point operand has a sign 
(positive or negative, an exponent, and a fraction) . The fraction is 
a fractional representation. The exponent is the value used to pro- 
duce a power of two scale factor that is subsequently used to multiply 
the fractions to produce an unsigned value. 

FORTRAN 

High-level software language mainly used for scientific applications. 

Fraction 

A part of a floating point number. The fraction is the unsigned frac- 
tional part that denotes the magnitude of the operand. 

Frame 

See Page Erame, Stack Frame 

Function unit 

A function unit is a part of the central processing unit (CPU) which 
performs a set of operations on quantities stored in registers. 

Gate array 

A structure that is used by the ring protection mechanism. The gate 
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array defines the entry points from a lower privileged ring to a 
higher privileged ring. 

Gather 

Loading a vector register using another vector of indices instruction. 
See the ldvi instruction. 

Guard bit 

A bit to the right (the least significant bit positions) of a floating 
point fraction. The guard bit is used in intermediate calculations 
using floating point operands. 

Half word (h) 

Two bytes (16 bits) 

HUFFMAN'S encoding 

A binary encoding that results in the densest packing of information. 

I cache 

See Instruction Cache. 

Immediates 

Operands which are contained within the instruction stream. 

Indexing 

The process of adding a displacement to the contents of an address 
register . 

Indirection 

The process of obtaining the address of an operand by first referenc- 
ing a word contained within memory. 

Instruction 

Used by the programmer to direct operations on the systems' register 
set and memory. 

Instruction cache (ICACHE) 

The I -Cache contains the most recently accessed instructions. The I- 
Cache accelerates the decoding of instructions. This permits the 
simultaneous decoding on one instruction with the execution of another 
instruction . 

Interrupt 

An occurrence other than an exception which changes the normal flow of 
instruction execution. An interrupt originates from hardware, such as 
an I/O device. 

Interval timer 

A privileged register. The interval timer is used to generate an 
interrupt based on the passage of a period of time. 

Kernel 

A part of the operating system that resides in ring 0. The kernel 
typically manages process creation and deletion, scheduling, and other 
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high level, system wide features. 

Linker 

A software tool. The linker "links" together separate software 
modules into one monolithic module. 

Loads 

A class of instructions which move data from memory to a register. 

Locality of reference 

An attribute of a memory reference pattern. Locality of reference 
refers to the likelihood of an address of a memory reference being 
numerically close to a recent memory reference address, or the likeli- 
hood of a subsequent memory reference being identical to a previous 
memory reference within a given period of time. 

Logical address 

Logical address space is that space seen by the application program- 
mer . 



Logical cache 

The logical cache is a cache that is accessed with logical addresses 
for fast retrieval of data. It resides in the central processing 
unit. 

Logical memory 

Logical or virtual memory is that memory seen by the programmer. The 
logical memory of a CONVEX computer is 4 Gigabytes. 

Longword (1) 

Eight bytes (64 bits) , the largest integer data type directly sup- 
ported by hardware in the CONVEX-1. 

LSI (Language Specific Information) 

The area in the stack that is created as part of a subroutine call. 
It is Ian gauge dependent and may be zero. 

Machine exceptions 

Machine exceptions include fatal errors in the system which cannot be 
handled by the operating system. (See Exception). 

Main memory 

See Physical Memory. 

Maskable interrupt 

An interrupt that is masked out . That is , an interrupt that the 
operating system wishes, at this time, not to respond to. 

Memory management 

The hardware and software features which control page mapping and pro- 
tection. 

Microcode 
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A control program that resides within the central processing unit. 
Microcode also refers to firmware, providing the necessary control 
that maps assembly language instructions onto processor hardware. 

Modified bit 

A bit within the central processing unit. The modified bit records 
all valid write references to pageframes. The modified bit is used by 
the operating system for memory management. 

Negate 

An instruction which performs a 2's complement. 

Normalization 

The process of left shifting a fraction until the leading bit is a 1. 

Opcode 

The code or sequence of bits in an instruction which determines the 
operation to be performed. 

Operand 

A register or memory location referenced by an instruction. 

Orthogonality 

A characteristic that pertains to the relationship of instructions and 
the operands they manipulate. An instruction set is orthogonal if one 
can change one property without having to change other related proper- 
ties. 

4 

Packets 

A group of related items. A packet may refer to the subroutine argu- 
ments or to a group of bytes that is transmitted over a network. 

Page 

A page is the unit of logical memory controlled by the memory manage- 
ment algorithms. In CONVEX- 1; a page is 4 K (4096) contiguous bytes. 

Page fault 

An exception caused by a reference to a valid non-existent page. 

Page Frame 

A page frame is the unit of physical (main) memory in which pages are 
placed. Associated with each pageframe are referenced and modified 
bits to aid in memory management . 

Page Table Entry (PTE) 

An entry in a page table. A PTE is a word. A PTE contains various 
flags and fields that are used in the translation of logical to physi- 
cal addresses. Address translation uses two levels of page table 
indexing. The first level page table is referenced using bits 28 
through 22 of a logical address. This is called the Index. 1 field. 
The second level page table is referenced using bits 21 through 12 of 
a logical address. This is called the Index. 2 field. See Figure 5-4. 

Physical address 
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Hardware -identified address in physical (main) memory consisting of a 
page frame number and the number of a byte within the page. 

Physical cache 

The physical cache provides rapid access to recently used physical 
memory data items. 

Pipelining 

A technique used to construct high performance processors. Pipelining 
provides a means by which multiple operations occur concurrently. 

Porting 

Moving software from one type of machine to another. 

Priority 

An ordering of events. Priority is applied to protection levels as 
well as I/O interrupt levels. 

Privileged instruction 

An instruction used by the operating system or privileged systems pro- 
grams. It must execute in ring 0, or an exception occurs. 

Process 

A process is the fundamental unit of program which is managed by the 
j ob schedu 1 er . 

Process exceptions 

Process exceptions belong to the currently running process and may be 
handled with an exception handler in that process. The exception 
handler is in the current ring of execution. (See Exception) . 

Protection 

A mechanism provided by hardware and software. Protection is used to 
ensure that one user is protected from another user or to ensure that 
a user does not perform an unsafe computation. 

Processor Status Word (PSW) 

A word that contains control flags. The PSW is used to control and 
indicate the state of various computations and sequences within the 
processor . 

Push 

The act of storing an operand on the stack. 

Queue 

A data structure in which entries are made at one end and deletions at 
the other. Often referred to as first-in first-out or FIFO. 

Quotient 

The result of a division operation. 

Read 

A memory operation in which the contents of a memory location are 
accessed and passed to another part of the machine. 
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Recursion 

An arithmetic operation that uses the output of a calculation as the 
input of the same calculation. 

Reduced Instruction Set Computer (RISC) 

An architectural concept that applies to the definition of the 
instruction set of a processor. A RISC instruction set is an orthogo- 
nal instruction set that is easy to decode in hardware and for which a 
compiler can generate highly optimized code. 

Reductions 

An arithmetic operation that performs a transformation on an array 
that produces a scalar result. 

Register 

A hardware entity that is used to contain addresses, operands, and 
status . 

Reservation 

Reservation is the process of managing the various function units in 
the central processing unit. A reservation table is used to record 
the current status and availability of the function units. 

Reset 

The process of establishing a known state in a machine register. 

Rings 

A ring is the unit of logical memory used for protection purposes. 
There are five rings in CONVEX machines: four for system level usage 
and one for users. The system rings (Ring0-Ring3) each correspond to 
one Segment of logical memory, while the user ring (Ring4) contains 
four segments . 

Ring Maximization 

Ring maximization is the mechanism used to enforce protection in the 
logical address space. 

Round bit 

One of the two guard bits used in the Intermediate representation of 
a floating point number. 

Rounding 

The process of transforming the intermediate representation of a 
floating point number to the memory representation. Unbiased rounding 
uses the round, guard, and sticky bits to determine the exact nature 
of this transformation. Truncation (as used in converting floating 
point to fixed point integer) does not use the round, guard, or sticky 
bits . 

Runtime 

A software module. A runtime is a software module that is referenced 
as a procedure. A runtime represents a required function that is not 
directly supported by the hardware, but it is required by the 
software. 
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Scatter 

Storing a vector register using another vector of indices. See the 
stvi instruction . 

Segmented ALU 

A logic design technique that permits multiple arithmetic operations 
of the same type to be pipelined. 

Segment 

The segment is the basic partition of the logical memory space. A 
segment is 512 megabytes. 

Segment descriptor register 

Each segment of virtual memory has a segment descriptor register asso- 
ciated with it. Each SDR contains information pertinent to the access 
and mapping of virtual addresses. 

Shift 

A class of instructions used to shift the contents of a register right 
or left. 

Single (s) 

A single precision floating point number stored in 32 bits. 

Source 

A register or memory location used as an input to a CONVEX instruc- 
tion. 

Spatial reference 

An attribute of a memory reference pattern. Spatial reference per- 
tains to the likelihood of a subsequent memory reference address 
being numerically close to a previous address. 

Stack 

A data structure in which the last item entered is the first to be 
removed. Also referred to as last-in first-out (LIFO) . In particu- 
lar, stacks are used by the Call and Return instructions. 

Sticky bit 

A bit used in the intermediate calculations of floating point 
operands. The sticky bit remembers if any binary l's were shifted out 
during an alignment or partial product operation. 

Stores 

A class of instructions used to move the contents of registers to 
memory . 

Subroutine 

A software module. A subroutine is a frequently used program that is 
called from various places in a program. 

System exceptions 

System exceptions cannot be handled by the current process; they 
require intervention by the kernel executing in ring 0. (See 
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Exception) . 

Trace of instruction execution 

The process of tracking the execution of every instruction of a pro- 
gram . 

Trap 

An out of sequence branch due to the occurrence of an abnormal condi- 
tion. Typically, this condition is a result of unexpected arithmetic 
results. (See Exception.) 

Trojan Horse Pointer 

The Trojan Horse Pointer is an addresss that is passed from one ring 
to another as part of a system call. In particular, this passed 
pointer references the more privileged ring as contrasted to the less 
privileged ring. This is unexpected and undesirable. 

True Zero 

A floating point number with zero sign bit, zero exponent, and zero 
fraction . 

Unbiased rounding 

The process of interpreting the round, guard, and sticky bits. 
Unbiased rounding, as contrasted to biased rounding, rounds to even in 
the event that the intermediate floating point result is exactly mid- 
way between two floating point representations . 

UNIX 

An operating system. 

Unsigned 

A value that is always positive. 

Valid bit 

A bit used in the control of caches. The valid bit is used to deter- 
mine if a cache entry contains an entry that can be used. 

Valid reference 

A valid reference meets two requirements: first, the PTE must be 
valid (bit 31=1) , and second, the type of access being made (Read, 
Write, or Execute) must be allowed by the appropriate protection bit 
(bits <3..1> of the PTE). 

Vector 

An array with one dimension. 

Virtual address space 

See Logical Address Space. 

Word 

Four bytes (32 bits) --the fundamental width of items in the CONVEX 
family of computers. 

Working set 
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That portion of a user's program that is currently in physical memory. 
Typically the working set is much smaller that the user program. 

Write 

A memory operation in which a memory location is updated with new 
data . 

Zero 

In floating point number representations, zero is represented by a 
zero sign bit and zero exponent. 
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